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IDENTITIES  OF  THE  ROGERS-RAMANUJAN  TYPE 

By  W.  N.  BAILEY 
[Received  28  April  1944—jReod  15  June  1944] 

1.  In  a  recent  paper*  I  gave  a  method  of  obtaining  transformations  of 
basic  hypergeometric  series  which  gave  the  most  general  known  trans- 
formations of  such  series  as  well  as  a  new  transformation  of  a  nearly  poised 
basic  series.  The  method  also  led  to  new  identities  of  the  Rogers-Ramanujan 
type.  This  method,  combined  with  a  further  study  of  Rogers's  work,  has 
led  me  to  a  considerable  simplification  in  the  ideas  underlying  the  methods 
of  finding  transformations  of  hypergeometric  series,  and  also  identities  of 
the  Rogers-Ramanujan  type,  which  are  essentially  limiting  cases  of  trans- 
formations of  basic  series.  The  simple  fundamental  result  is: 

If  Pn  =   S  ar^-r<W> 

r-O 

00 

and  yn  =  £  <*r*V-n<W 

r»w 

then  S««r»=  SAA- 

w=»0  n»0 

The  proof  is  almost  trivial.  In  fact, 


n*=0  r—n 

=  S     S  <* 
r=0  n-0 

oo 


We  assume,  of  course,  that  the  series  converge  and  that  the  change  in 
the  order  of  summation  is  allowable.  In  all  the  cases  given  in  this  paper  the 
justification  would  be  simple. 

*  Bailey  (2). 
SBB.  2.    VOL.  50.    NO.  2379.  A 
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The  above  result  could  be  used  to  work  out  all  the  most  general  trans- 
formations of  generalized  hypergeometric  series  of  the  ordinary  type.  In 
fact,  the  method  of  finding  transformations  of  terminating  series,  given  in 
chapter  rv  of  my  tract,*  is  really  a  particular  case  of  the  above  result,  the 
numbers  an,  un,  vn,  dn  being  chosen  so  that  the  series  for  fn  and  yn  can  be 
summed  by  known  (and  comparatively  elementary)  results.  Similarly,  the 
method  given  in  my  paper (2)  for  finding  transformations  of  basic  series  is 
really  a  particular  case  of  the  above  result,  the  numbers  an,  un,  vnt  Sn  being 
chosen  so  that  the  series  for  pn  and  yn  can  be  summed  by  known  sums  of 
basic  hypergeometric  series.  In  the  case  of  series  of  the  ordinary  type  it 
was  only  necessary  to  assume  the  theorems  of  Saalschiitz  and  Dixon  for 
the  sums  of  certain  series  ZFZ,  but  in  working  with  basic  series  I  assumed 
immediately  a  knowledge  of  Jackson's  basic  analogue  of  Dougall's  theorem. 
It  may  not  be  necessary  to  assume  so  much,  but  the  proof  of  Jackson's 
identity  is  quite  elementary,  and  it  hardly  seemed  to  be  worth  while  to 
avoid  assuming  it,  particularly  as  there  is  no  exact  counterpart  of  Dixon's 
theorem  for  basic  series,  and  the  analogue  of  Rummer's  theorem  is  not 
nearly  so  well  known  as  Rummer's  theorem  itself. 

2.   The  form  in  which  we  use  the  result  of  §  1  is: 

n  ~ 

if  Pn  =  so  Wn 

and  Tn  =  S   7-T 


w—O  w=0 

Here  I  use  the  notation  of  my  previous  paper, 

(a)n  =  (1  -a)  (1  -ax)  (I  -ax2)  ...  (1  -aa?"-1), 
and  later  I  use  the  abbreviations 

xn  =  1  -xn,     xn  =  1  +  xn,     xn\  =  (x)n,     xn\\ 


[a]n  =  (1  -a)  (1  -a*2)  ...  (1  -ax**~*)t    {a}n  =  (1  -a)  (1  -ax*)  ...  (1 
3.   The  formula  for  yn  can  be  written  as 


*  Bailey  (1). 
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Taking  8r  =  (p^)r  (p2)r  (ax/pip^Y,  we  can  sum  the  series  for  yn  by  the  basic 
analogue  of  Gauss's  theorem,  and  we  find  that 

n 


U' 

With  these  values  of  yn  and  dn,  the  result  of  §  2  gives 


iUp«)»  /^  \*    /3. 

i)»(«»/Pa)n  "' 


This  formula  has  an  interesting  connexion  with  Rogers's  work.  Following 
Rogers,  we  define  An(6)  by  the  identity 


oo      A     (f)\  oo 

S—  ^V^53  II  (l- 


and  then 


^  2cos50+ 


...). 

/ 


Thus,  if  we  take  a0  =  1,  ar  =  2  cos2r#,  and  a  =  1,  then  /?w  =  -4.2W 
while  if  we  take  ar  =  2  cos  (2r  +  1)  ^  and  a  =  #,  then  >^n  =  ^L2n 
We  therefore  obtain,  from  (3-1),  the  identities 


n-O  .  ». 


ss     II      I I 

9.     (3-3) 


i)n  \PiP* 

A2 
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If  we  replace  plt  p2  by  at  fa,  x*/v  in  (3-2)  we  get  §  5  (  1  )  of  Rogers's  paper  (6), 
while  if  we  replace  pv  p2  by  x/u,  x/v  in  (3-3)  we  get  §  5  (2)  of  the  same  paper.* 
From  these  two  formulae  Rogers  derived  a  list  of  twenty-one  particular 
cases  which  he  used  to  find  identities.  It  will  be  noticed  that  these  for- 
mulae are  contained  in  (3-1)  which  is  obtained  by  a  simple  application  of 
the  analogue  of  Gauss's  theorem. 

4.   If,  instead  of  using  the  analogue  of  Gauss's  theorem,  we  use  the 
analogue  of  Saalschiitz's  theorem,  we  can  take 

,  _  (A),  to),  (*-"),,, 


where  N  is  a  positive  integer,  and  we  find  that 


^  (-l)M(p1)n(/og)lt(ar-^)n  /  ax  \n 

(WC)N  (MCIPlPl)N   (o»/Pl)n  (axIPz)n  (aXN+l  )n  \PlpJ 

These  values  of  $r,  yn  reduce  to  the  previous  values  when  N  ->  oo,  and  lead 
to  more  general  results  involving  only  terminating  basic  series.  We  are, 
however,  more  concerned  with  identities  of  the  Rogers-Ramanujan  type 
in  this  paper,  as  the  most  general  formulae  for  basic  series  (apart  from  those 
already  given)  are  too  involved  to  be  of  any  great  interest. 

5.  Another  useful  set  of  values  for  ynt  dn  can  be  obtained  by  using  the 
formula 


[7,  g;    a!  _    «   Rl  -ax^lf)(l-ax«'lg)-\fg(l  +  a)-( 

1  'I**  jy  ~JiL  a  -«^>  a  -«*"•/&)  J       #-        '  ( 


instead  of  the  analogue  of  Gauss's  theorem.  f  This  can  be  obtained  from 
§  8-5  (2)  of  my  tract  by  letting  c,  e-^oo  and  putting  d  =  l/q.  It  can  also  be 
obtained  by  using  the  simple  identity 

A  W>  9>  al_  <D  |7'  y~>  "*"|  i  (*  -/Hi-?)  «  *  [fa,  a*;  £l 

11  L  ew   fy]  ~  A  L  ™    Tg\  l-o«       fff*    1  1   a*1     /d  ' 

and  summing  the  two  series  on  the  right  by  the  analogue  of  Gauss's  theorem. 

We  thus  find  that  if       dr  =  (p^  (p2)r  (alp^Y, 

then      y  to)n(ft)n       /  a   \MpiP»(1+««i>)-«»g>(Pi  +  Pin 

7n      (ax/p1)n(ax/p2)n\p1pj    L  PiP2-«  J 


*  The  formulae  §5(1)  and  §  5  (2)  of  (6)  are  actually  proved  in  (5). 
t  This  formula  was  pointed  out  to  me  by  Mr  F.  J.  Dyson. 
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This  simplifies  when  />2->oo,  in  which  case  we  get  the  set  of  values 


. 

J 


6.   The  relation  giving  fin  in  terms  of  the  a's  is  practically  one  that  was 
considered  in  my  previous  paper.  We  can  write 


ft     _  _  _  _    v     v         '**    /       I  \r  ~$r(2n-r+l)  ~ 

^  ~  (5u5£jrnr?,  W^v    } 

and  we  know  from  that  paper  that  there  are  several  ways  of  choosing  ar  so 
that  the  series  for  fln  can  be  summed  by  Jackson's  identity.  It  is  perhaps 
worth  noting  that  if  we  take 


we  get  fln  =  ,,.  & 

*  Pn      (x)n(ax)n*    l 

We  cannot  choose  y  independent  ofn  so  that  the  series  20±  can  be  summed 
by  the  analogue  of  Gauss's  theorem,  whereas  for  series  of  the  ordinary  type 
we  can  use  Gauss's  theorem  and  obtain  a  transformation  of  a  nearly  poised 
4^3  into  a  Saalschiitzian  5^4.*  Thus,  as  I  stated  before,  there  appears  to  be 
a  breakdown  in  the  analogy  between  basic  series  and  series  of  the  ordinary 
type  for  nearly  poised  series  expressed  in  terms  of  Saalschiitzian  series. 

In  §  8  of  my  previous  paper  we  found  the  following  sets  of  values  of  an,  fin: 


(i)  a0  =  1,     an  =  (  -  1^  (1  -(W)        s=4o"«»«»»-»f     fl,  =  ~. 

jcn.  jc,n\ 

(ii)  Changing  a  into  a2,  x  into  x2, 


(iii)  Changing  a  into  a3,  x  into  a3, 

«      _  /        l\n/l  ' 

«,-(-!)   (1- 


*  Actually  §  4-6  (  1)  in  my  tract. 
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Also  from  §  10  of  the  same  paper, 
'0,     azn 


***      xn\[ax]n(ax/f)n' 
xn\ 

Pn  = 


From  (i)  and  (3-1)  we  get  merely  a  limiting  case  of  Watson's  transforma- 
tion from  which  the  actual  Rogers-Ramanujan  identities  can  be  deduced. 
From  the  other  sets  of  values  we  get,*  with  (3-1), 


V          l2n-2. 

* 


l    (  1  -  «2^2w)  (1  -  aVm/Plp2) 

v  fl  4-    V    (^)n-l  (^)n  ^lln  !/>«]»  C1  -^2")«3^8^"|        ,6. 
L         »-l    ^nK^/^^^VpJnt^VPJ^PlPl*)11   J' 


~) 

^  ^^WP^  (6.2) 

n-ixn\{a?x*lpi}n{a?x*lp^n  \pipj  J 


ml     (  1  -  «^W)  (  1  -  ^m/PlP2) 

x  fl  4-   V    ^2]^-l  [/ln( 
L        «tfl       ^l[^2//L 

V    (Pl)n(P2)n{^3)n-  n 


X  fl  4.    V    (  -  ^  ^^-^  (1 
L         n-1  *Zl(**IPl 


*  When  Pi.pa-*00*  th©  formulae  (6-l)-(6-4)  become  the  same  as  (8-3),  (8«4),  (10-1} 
and  (10-6)  of  (2). 
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Similarly,  we  can  combine  the  sets  of  values  of  an,  ftn  with  (5-2).  It  will 
suffice  to  give  the  result  obtained  from  the  set  (v),  namely, 

1      g.  (-l 


Pi 


(3n-in 

x  (  1  +  a#6w  -  <*a?*lpi)  -  ^  -    .     (6-5) 

From  these  formulae  many  identities  of  the  Rogers  -Ramanuj  an  type 
can  be  derived  by  choosing  the  parameters  suitably.  For  example,  from 

(6-1),  if  we  let  plf  b  ->  oo,  and  put  p2  =  —  a#,  we  get 


->    ,     (6-6) 
and  for  ct  =  1  this  gives 

00         "-  ""I 

(6'7) 


which  is  one  of  the  more  elegant  sums  given  by  Rogers.* 

The  formulae  (6-1)  and  (6-3)  give  many  known  results,  and  I  therefore 
consider  only  the  formulae  (6-4)  and  (6-5).  These  give  all  the  formulae 
labelled  B,  C,  D  and  E  in  the  list  given  in  §  11  of  my  previous  paper  (and 
found  by  F.  J.  Dyson). 

For  pl9  p2-+oo  in  (6-4),  we  get  (B4)  when  a  =  1,  and  (Bl)  when  a  =  #3. 

For  />2->oo,  /0J  =  —*J(ax),  we  get  (C3)  when  a  =  1,  and  (D  1)  when  a  =  a3. 

Forp!  =  —  <ja,  p2->oo,  a  =  a;8  gives  (Cl),  and  a  =  1  gives  (D3). 

For  p±  =  J(ax),  p2  ->  oo,  a  ==  #3  gives  (El). 

From  (6*5),  for  pj^oo,  we  get  (B2)  when  a  —  x3,  and  the  sum  of  (B3) 
and  (B4)  when  a  —  I. 

For  PJ  =  —^J(ax),  we  get  a  linear  combination  of  (Dl)  and  (D2)  when 
a  =  x*. 

For  p±  =  —  *Ja,  we  get  (02)  when  a  =  #3,  and  a  linear  combination  of 
(D2)  and  (D3)  when  a  =  1. 

For  pj  =  ^(ax),  we  get  a  linear  combination  of  (E  1  )  and  (E  2)  when  a  —  x*. 

*  Rogers  (4,  last  formula  on  p.  330).  See  also  Jackson  (3,  175),  where  a  different 
generalization  of  (6-7)  is  given. 
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In  the  course  of  the  work  we  naturally  obtain  generalizations  of  these 
formulae  involving  a  parameter  a.  The  formula  (B  3),  namely 


(6'8) 


is  a  particularly  elegant  one,  and  so  I  give  an  ^-generalization  which  I  found 
rather  troublesome  to  obtain. 

As  indicated  above,  we  let  pv  /o2->oo  in  (6-4)  and  (6-5)  and  we  get 

1+  S 


-Ate 


Subtracting,  we  get 

V          {^)n-l       an^-.^    ^    /_  L 

».  i  ^n_!  !  (as)an-i  «"  i  \  1  -  « 

~  1  )n  {^3}»-i     4Mi(a7n«- 
7l! 

S  (- 


a-  y  _  n    4 

n-o         ^! 


It  follows  that 
2 


S  —  -  "1  (  1  -  a2*12n)  g*^^^8-^)    . 


S  (-^-^^^  (6-9) 

n=0  ^n' 

This  gives  (6-8)  when  a  =  1,  and  it  is  the  most  elegant  generalization  of  that* 
formula  which  I  have  been  able  to  find. 
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7.  It  has  not  been  my  object  to  find  a  large  number  of  new  identities  of 
the  Rogers-Ramanujan  type,  but  it  is  perhaps  worth  while  noting  some 
which  I  believe  are  new.  Thus  from  (6-4),  by  taking  pt  =  —  ^ja,  p%  =  —^/(ax)t 
^Ja  =  x*  or  1  ,  and  finally  changing  x  into  a;2,  we  obtain  the  formulae 

V      *******     -  TT  (1  ~a?18n)  (1  -a?1871"3)  (1  -a 

«       --i 


- 

nt'l^n-l!^!  "nl    (1  -*-)  (1  -0^) 

-  (l+^Hl-a*") 

-9*'  ^        ' 


Some  results  from  (6-2)  are 


(<7. 
\' 


Some  further  identities  communicated  to  me  by  Mr  Dyson  are 

oo    ,£3na,£2    j  oo     Q  _  /j.l2w—  5\  Q  _  /j.l2n—  7\ 

^^,121^3    I  ~    Al  1~_  ~ 

n=*0xn  'X2n'        n—  1  1  • 


-!  _ 

.-ia4law.il      .^(l+a^-^Xl-a^-1)  (!-«*•)'  l/    ' 

oo    /  _  i  \n  /j-i^+n)/*.          f          <» 

V     A^   -   -   ^sn+i-  ^ 


w,  (7-8) 

=0  2n+l  w«0  n~l 

_  T  \n  ^.f<na+n)r      f/i  _  ^3n+2\ 

_iL£  -     3(          -  -j 


^.          (7-9) 

n=*0  2n+l-  n«0  n-1 

Of  these,  (7-6)  and  (7-8)  can  be  derived  from  (6-2),  and  no  doubt  the  others 
can  also  be  obtained  from  the  formulae  of  this  paper. 

8.   I  now  give  a  final  application  of  the  simple  result  of  §  1. 
Define  the  differences  Dn(a0)  by  the  relations 

D(aQ)  =  ax-a0, 


sphere  nl  =  (l-q)(l-q2) ...  (l- 


10  W.  N.  BAILEY 

Then  Dn(a0)/nl  is  the  coefficient  of  xn  in  the  product 


that  is  in  n  (l-tfte)  S      r  • 

n-O  ' 

Itfollowathat 


and  equating  the  coefficients  of  xn  on  the  two  sides  we  get 


Now  let  /(a?)  =      &X,    f'(x)  =     x 

o 

Taking  ar  =  (af/r!)Dr(a0),  ^  =  6rr!,  ur  =  a;r/r!,  vr  =  1  in  §  1,  we  find  from 
(8-1)  that  /?n  =  anxn/n\3  and  evidently  yn  =  f(n\x).  It  follows  that 


This  result  was  given  by  Jackson  and  used  by  him*  to  obtain  numerous 
identities.  * 
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1.    Introduction 

In  a  previous  paperf  I  gave  a  method  by  which  the  spectral  resolution 
corresponding  to  a  self-adjoint  operator  can  be  deduced  from  the  solution 
of  the  Schrodinger  equation  for  that  operator.  Unlike  other  proofs  of  the 
existence  of  the  spectral  resolution!  this  proof  did  not  depend  on  the  use 
of  the  resolvent  operator  or  on  operators  which  are  effectively  real  parts 
of  resolvent  operators;  but  the  proof  of  a  solution  of  a  Schrodinger  equation 
corresponding  to  a  self-ad  j  oint  operator,  given  in  theorem  II  of  my  paper,  does 
depend  on  the  use  of  the  resolvent  operator.  The  first  object  of  this  present 
paper  is  to  provide  a  proof  of  this  existence  theorem  which  does  not  depend 
on  the  resolvent  operator.  Taken  together  with  the  paper  referred  to  above, 
this  section  of  the  present  paper  should,  therefore,  be  of  interest  as  giving 
a  new  approach  to  the  problem  of  the  self -adjoint  operator. 

The  proof  here  given  shows  that  a  solution  of  the  Schrodinger  equation 
for  the  adjoint  operator  (eqn.  (1))  exists  with  any  element  of  the  domain  of 
the  operator  as  initial  value,  and  that  this  is  true  for  all  symmetric  operators. 
This  result  can  be  made  a  basis  for  a  discussion  of  the  properties  of  general 
symmetric  operators:  but  the  discussion  in  this  paper  is  confined  to  self- 
adjoint  and  maximal  symmetric  operators. 

For  self-adjoint  operators,  the  solution  of  the  Schrodinger  equation 
appears  as  the  transform  of  the  initial  value  by  a  unitary  operator:  the 
totality  of  these  unitary  operators  forms  a  one-parameter  continuous  group. 
This  is,  of  course,  well  known:  the  novel  feature  of  the  treatment  here  is 
that  the  result  is  demonstrated  directly,  without  the  intervention  of  the 
spectral  resolution,  so  that  the  discussion  of  the  group  of  one-parameter 
unitary  operators  is  the  basis  of  the  discussion  of  the  spectral  resolution, 
instead  of  vice  versa  as  in  previous  discussions. § 

For  operators  which  are  maximal  but  not  self-adjoint,  a  solution  of  the 
Schrodinger  equation  for  the  operator  exists  for  every  initial  value  either  for 

t  Ref.(lO).  J  See  refs.  (1,  2,  3,  4).  §  See  refs.  (6,  7,  8). 
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all  positive  or  for  all  negative  t,  according  to  the  class  of  the  operator,  but 
not  for  both.  The  solution  is  an  isometric  transformation  of  the  initial 
values;  the  set  of  isometric  transformations  forms  a  one-parameter  con- 
tinuous half  -group.  By  this  we  mean  that  the  set  has  the  multiplicative 
properties  of  a  one-parameter  group,  but  not  the  property  that  any  member 
possesses  an  inverse  operator  with  a  domain  dense  in  Hilbert  space.  The  study 
of  this  set  of  isometric  transformations  leads  to  a  new  theory  of  maximal 
operators,  and  a  new  type  of  resolution  of  the  identity  is  shown  to  corre- 
spond to  these  operators.  The  resolution  of  the  identity  given  by  von 
Neumann  and  Stone  for  self-adjoint  operators  has  the  property  that  if  the 
operator  has  a  simple  spectrum  the  Hilbert  space  is  mapped  on  L\  —  oo,  oo) 
in  such  a  manner  that  /(A)  of  L\  —  oo,  oo)  is  carried  by  the  operator  into 
A/(A).  The  resolution  of  the  identity  given  here  has  the  property  that  in  the 
case  of  an  irreducible  maximal  symmetric  operator  of  deficiency  index 
(0,  1)  the  Hilbert  space  is  mapped  on  to  £2(0,oo)  in  such  a  manner  that 
f(t)  of  L2(0,  oo)  is  carried  by  the  operator  into  i(df/dt). 

The  close  association  between  the  Schrodinger  equation  and  the  spectral 
theory,  in  this  and  my  previous  paper,  has  some  merits  from  the  physical 
point  of  view.  Whereas  the  idea  of  a  symmetric  operator  derives  in  quantum 
theory  from  a  physical  requirement  —  that  the  values  of  the  corresponding 
observable  must  be  real  —  the  distinction  between  self  -ad  joint  and  other 
symmetric  operators  seems  to  have  no  physical  meaning.  On  the  other  hand, 
the  hypothesis  of  the  first  theorem  of  my  last  paper,  that  the  Schrodinger 
equation  has  a  solution  for  all  values  of  the  variable  it  has  the  physical 
meaning  that  the  system  can  have  an  infinite  past  and  future,  while  the 
variable  corresponding  to  the  operator  retains  a  constant  probability  dis- 
tribution. If  the  Schrodinger  equation  has,  for  example,  no  solution  for 
sufficiently  large  positive  t,  then  the  system  can  develop  for  only  a  finite 
time  with  the  corresponding  observable  constant.  A  simple  example  is  a 
particle  moving  with  constant  momentum  on  a  semi  -infinite  line;  if  it  is 
moving  to  the  boundary,  its  motion  can  continue  only  for  a  finite  time. 
The  corresponding  Schrodinger  equation  has  solutions  for  all  negative,  but 
not  all  positive  J.f 

1  d 

f  The  operator  here  is  T  —  -  ,  defined  for  all  absolutely  continuous  functions  which, 
^  dx 

with  their  derivatives,  are  in  L2(Q,  oo)  and  vanish  for  x  =  0.  The  solution  of  (1  A)  is, 
for 


For  positive  t  a  solution  is  possible  only  for  those  values  of  t  with  <f>(x)  =  Q  (x<t).  The 
reading  of  this  paper  will  be  facilitated  if  the  reader  considers  the  application  of  the 
arguments  to  this  example,  which  is  further  discussed  in  the  last  section. 
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Through  the  work  of  Koopman,f  these  considerations  can  be  extended 
to  classical  mechanics.  Koopman  has  shown  that  systems  with  constant 
energy  can  have  their  motion  represented  by  a  one-parameter  group  of 
unitary  operators.  Here  again,  one-parameter  half-groups  of  isometric 
operators  would  appear  to  correspond  to  systems  which  for  some  initial 
conditions  can  have  only  a  finite  future,  or  a  finite  past. 

Just  as  the  theory  of  groups  of  unitary  operators  corresponds  to  that  of 
self  -adjoint  operators,  and  has  been  worked  out  as  a  separate  problem  by 
von  Neumann  and  Stone,  J  so  there  will  be  a  theory  of  half  -groups  of  isometric 
operators  corresponding  to  the  theory  of  maximal  operators  in  this  paper. 
From  this  point  of  view,  the  theory  of  this  paper  is  confined  to  half  -groups 
for  which  the  operators  are  differentiable  functions  of  the  parameter.  It 
remains  to  be  investigated  how  far  the  hypothesis  of  differentiability  can 
be  removed. 

A  final  section  of  the  paper  gives  some  applications  to  the  theory  of  Hilbert 
transforms  and  Watson  transforms,  etc.,  which  should  serve  as  examples. 

Except  where  the  contrary  is  stated,  the  terminology  and  notation  of 
this  paper  follow  that  of  ref.  (2). 

2.    Preliminary  remarks  on  differential  equations  in  Hilbert  space 
In  what  follows  we  shall  be  concerned  with  equations  of  form 


where  fr(t),  %(£)  are  functions  of  a  real  variable  t  whose  values  are  elements 
in  a  Hilbert  space  §.  In  the  cases  to  be  encountered,  %(£)  is  a  linear  trans- 
formation of  ty(t).  The  left-hand  side  is  a  differential  coefficient  of  t/f(t)t 
the  limit  of  [i/r(t  -4-  h)  —  i/r(t)]/h.  This  limit  may  exist  either 

(i)  in  the  strong  sense  —  in  which  case  we  shall  talk  of  the  strong  differ- 
ential coefficient,  denoted  by  d^fjdt,  and  of  the  differential  equation  being 
obeyed  in  the  strong  sense;  or 

(ii)  in  the  weak  sense  —  in  which  case  we  talk  of  the  weak  differential 
coefficient,  etc.,  < 


t  Ref.  (9).  %  See  refs.  (6,  7,  8). 

§  The  strong  sense  is  the  more  natural  one  in  Hilbert  space  theory,  but  the  weak 
sense  corresponds  more  closely  to  the  normal  interpretation  of  a  differential  equation 
—  e.g.  if  we  consider  the  Hilbert  space  L2(  —  00,00),  and  ifr(t)  =  fi(x,t)  is  an  element 
of  the  space  for  each  t,  the  equation 


would  be  interpreted  to  mean  that  the  differential  coefficient  is  equal  at  each  point  x 
to  x(x*  0  —  which  is  practically  the  weak  sense. 
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If  <f>n  is  a  complete  normal  orthogonal  set  in  §,  and 


the  weak  sense  of  the  equation  is  equivalent  to  the  condition  that  for  all 

n  and  t  ,  , 


More  than  this  is  required  by  the  strong  sense  of  the  equation,  but,  as  we 
shall  see  from  the  following  lemmas,  the  difference  between  the  two  senses 
is  less  important  than  might  be  expected. 

LEMMA  A.  //  x(t)  ^8  bounded,  ||  x(t)  \^K  for  all  t  in  (a,  6),  then  for  all  t 
except  a  set  of  measure  zero 

i  rt+h 
limy         \\x(y)-a\\*dy=\\X(t)-a\\* 

h-+Q  n  J  t 

for  every  element  oc  in  §. 

The  proof  of  this  theorem  is  essentially  the  same  as  that  of  §  11-6  of  ref. 
(12).  For  a  fixed  a,  we  have 

i   rt+h 

Km.          \\X(y)-^dy^\\X(t)-oL\\\ 
h-+o  "  J  t 

except  at  a  set  of  points  of  measure  zero,  by  §11»53  of  ref.  (12).  Since 
Hilbert  space  is  separable  we  can  find  a  countable  set  of  elements  {/?}  which 
are  everywhere  dense  in  £).  The  equation  above  is  then  true  for  all  the  /? 
except  at  a  set  of  points  t  of  measure  zero  —  the  sum  of  the  exceptional  sets 
for  individual  ft.  Given  any  a  in  $$  and  e  >  0,  we  can  find  a  fl  such  that 
||  a  -  /?  ||  <  e.  Then  for  alH 


so  for  some  K  independent  of  t 


,  f«+*.. 

Hence 


I  rt 

\jt 


and  so,  if  t  is  not  in  the  exceptional  set  for  the  {/?}, 
rt+h 


since  e  may  be  as  small  as  we  please,  the  result  follows  for  any  a  in 
COROLLARY.  In  particular,  for  almost  all  t 


rt+h 
J( 
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LEMMA  B.  If  for  all  t  in  (a,  6) 


15 


then  the  equation  is  satisfied  in  the  strong  sense  for  almost  all  t  —  in  fact,  for 
all  t  save  those  in  the  exceptional  set  of  the  corollary  to  lemma  A. 
We  have 


-x(t) 


t+h 


st 


for  all  t  save  for  the  exceptional  set  of  lemma  A  corollary. 

LEMMA  C.  //  x(t)  ^s  a  continuous  function  of  t,  then  the  conclusions  of 
lemmas  A  and,  B  are  true  everywhere. 

This  is  a  consequence  of  the  fact  that  a  continuous  function  is  everywhere 
equal  to  the  derivative  of  its  integral. 

LEMMA  D.  //  in  an  interval  (a,b) 


then  for  all  t  in  (at  b) 
d 

We  have 
1,, 


{(VM<  +  *)  -  f i  (0, 
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The  second  bracket  is  less  in  absolute  value  than 


since  ^—^  -  ^  —  i-lLJ  (i  =  1,2)  are  bounded  because  they  are  weakly  con- 
fit 

vergent.  The  first  bracket  tends  to 


3.    The  existence  theorem  for  the  Schrodinger  equation 
for  symmetric  operators 

THEOREM  I.  If  H  is  a  linear  symmetric  operator  defined  in  the  domain  *%) 
of  the  Hilbert  space  §,  and  H*  is  its  adjoint  defined  in  the  domain  '£)*,  then  for 
every  element  (f>  in  3)  there  exists  for  all  t  an  element  i/r(t)  in  &*  such  that 
exists  and 

0.  (1) 


The  following  lemma  will  be  useful: 

LEMMA  1  .  //  i/r(t)  and  x(t)  w&  in  the  domain  ®  of  a  symmetric  operator  H 
for  all  tin  a  range  (a,  6),  are  weakly  differentiate  functions  oft  in  that  range, 
and  satisfy 


then  (ft(t)>  x(t  +  T))  is  independent  of  t  as  long  as  t  and  (t  +  r)  lie  inside  (a,  6). 
From  lemma  D, 


0. 


The  lemma  follows  immediately. 

Proof  of  theorem  I.  Let  a  sequence  of  closed  linear  manifolds  Wln  be  chosen 
so  that  9ftn+1  contains  9D£n,  so  that  the  closed  linear  manifold  determined  by 
the  upper  limit  of  the  $Rn  is  ^,  and  so  that  H  is  bounded  inside  each  one  of 
the  $Jln.  This  is  possible  because  the  domain  of  H  is  dense  in  $. 

Let  EN  be  the  projection  operator  corresponding  to  Uft^.  Consider  the 
equations 

(2) 
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The  operator  ENHEN  is  bounded  and  self-adjoint.  The  solution  of  (2)  can 
be  written  down,  formally, 

OO       /  *4\"n. 

(3) 


Let  the  bound  of  H  in  SDt^  be  .STy,  i.e.  in  < 


Then  for  all  <j>  ||  (ENHENy<[>  || 

and  the  convergence  of  (3)  follows,  for 


r!       5! 


-  ((ENHENY^  (ENHENY4>) 


0     as     m-^-oo. 


The  series 


also  converges  uniformly  over  any  interval  in  £.  It  is  easily  seen  to  be  equal 
to  both  --^  and  (ENHEN)\jf.   Since,  in  addition,  the  series  (3)  satisfies 

1    Cut  + 

i/rN(0)  =  (f>,  we  see  that  the  function  denned  by  it  is  a  solution  of  (2). 

dfrN/dt  and  dzi/rN/dt2   are   defined  by  similar  series,   and  satisfy  the 
differential  equation  (2)  with  the  boundary  conditions 


\(ENHENY<t>\\. 


Thus,  from  lemma  1,  we  have 


11  YNVI  ii       ii  v  ii  >          fa 
If  g  is  any  element  of  % 


dt3 


dt\~d 


0\ 

'9) 


8KB.  2.      VOL.  50.      NO.  2380. 


(4) 
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Let  {0n}  be  a  set  of  complete  normal  orthogonal  elements  of  $?,  all  lying 
in  ^).  The  manifold  ^QIN  may  be  taken  to  be  the  linear  manifold  determined 
by  all <fin  with  n^N.  Since  the  <[>n  are  to  a  large  extent  arbitrary  there  is  no 
loss  of  generality  if  we  assume  <f>  to  be  one  of  the  <f>n,  or  indeed  0r 


From  (4) 


d  /di/rN 
dt  \~di~ 


if  N  is  large  enough  (N^n).  This  last  expression  is  independent  of  N,  so 
for  a  fixed  n  the  sequence  (di/rN/dt,  <pn)  as  N  varies  from  1  to  oo  consists  of 
uniformly  continuous  functions  of  t. 

By  a  well-known  theoremf  on  sets  of  uniformly  continuous  functions, 
it  is  possible  to  choose  from  the  sequence  (d^jdt,^)  a  subsequence 
(dftNl  m/d2j^i)  which  tends  to  a  continuous  function  of  t  as  Nltm-*-co. 

From  the  sequence  Nl>m  we  can  choose  a  subsequence  Nz>m  such  that 
(d&Nt  m/dt,<f><i)  tends  to  a  limit  as  N2  jOT->oo,  and  so  on  for  ^3,  etc.  Then  if 
p  runs  through  the  numbers  Nm>m  the  sequence  (di/fp/dt,(fin)  tends  to  a 
continuous  function  of  t  as  p  ->oo  for  any  fixed  n. 


Write 

Now  ,  #.    =  i(EfHEr 


if  p  is  large  enough,  since  0  is  one  of  the  (f>n  ; 


so  there  exists  a  function  %(£)  such  that 


The  sequence  (dfrp/dt,  (j>n)  is  a  uniformly  bounded  sequence  of  functions 
of  t  which  tend  to  a  limit  hn(t)  ;  hence  the  integrals  of  the  functions  of  the 
sequence  tend  to  the  integral  of  their  limit: 


t  See  Courant  and  Hilbert,  Methoden  der  Mathematischen  Physik,  1  (1931),  49. 
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or  MO  dt  =  lim  WP(t)  -$,  #,). 

Jo 

Since  S  I  I*  X(T)<*T  2  <  *S  P|  *n(r)  \*d 


there  exists  an  element  in  ^>  defined  by 


and 

that  is,  ^r(£)  is  the  weak  limit  of  ^rp(t)  when  p  runs  through  the  sequence  of 
numbers  Nmm.  Clearly 


Since  ftp(t)^>i/f(t)  weakly,  for  every  g  in  ® 

For  all  p  |  (i/fp(t),Hg)  \  =  \  (Hi/fp(t),g)  \ 


so  that 

for  all  g  in  ®.  y^(^)  is  therefore  in  &*  and 


Since  2)  is  dense  in  £>  it  follows  that 


Hence  ^(t)  is  a  solution  of  (1). 
THEOREM  II.  If  H  is  a  self-adjoint  operator,  the  equations 


^  (1A) 

have  a  unique  solution  for  every  <f>  in  %  and  for  all  t. 


B2 
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Since  H*  —  H>  a  solution  of  the  weak  form  of  ( 1  A)  must  exist  by  theorem  I. 
We  shall  show  that  a  solution  of  the  weak  form  of  (1  A)  is  a  solution  of  the 
strong  form,  and  that  the  solution  is  unique:  it  is  of  some  importance  to 
notice  that  the  assumption  that  H  is  self-adjoint  is  used  only  to  prove  the 
existence  of  a  solution  of  the  weak  form  of  ( 1  A),  and  is  not  made  use  of  in 
the  subsequent  stages  of  this  proof. 

It  was  shown  in  the  proof  of  theorem  I  that 


to 

hence  by  lemma  B  the  solution  of  the  weak  form  of  (1  A)  is  a  solution  of 
(1A)  almost  everywhere.  The  points  at  which  (1  A)  is  satisfied  are  all  those 
at  which  [ft(t  +  h)  —  i/r(t)]/h  tends  strongly  to  a  limit,  that  is,  those  and  only 
those  points  at  which 

(tJL.  L\  —  ifr{t\         i/r/*_L  Ji'\__,/r/*\  I! 

^0 


h  h' 

as  h,  &'-*0  independently.  The  element  inside  the  modulus  sign  is  itself  a 
solution  of  (1  A)  (with  a  different  initial  value),  and  so  by  lemma  1  the  value 
of  the  modulus  depends  only  on  h  and  h',  not  on  t.  If  the  differential  coeffi- 
cient exists  in  the  strong  sense  for  one  t,  it  must  therefore  do  so  for  all  t: 
hence,  since  it  exists  for  almost  all  t,  it  exists  for  all  t,  so  that  (  1  A)  is  satisfied 
everywhere  in  the  strong  sense. 

To  prove  the  uniqueness  of  the  solution,  let  ifr^t)  and  ^2(0  ^e  ^wo  solu- 
tions of  (1A).  Then  ^fz(t)  =  ^i(t)  —  ^s(0  is  a  solution  of  the  differential 
equation  in  (1  A)  with  the  initial  value  ^8(0)  =  0.  By  lemma  1,  ||  \/r9(t)  \\  is 
constant,  hence  ^3(£)  =  0  for  all  t,  and  so  the  solution  is  unique. 

In  theorem  I  it  was  only  possible  to  prove  that  there  exists  a  subsequence 
frp(t)  of  the  elements 


which  tends  weakly  to  i/r(t).  If  H  is  self-adjoint,  it  can  be  shown  that  the 
sequence  vMO  (&  ~  1»  2,  3,  ...)  tends  strongly  to  i/r(t). 

In  the  first  place,  if  {p}  is  a  subsequence  of  {N}  for  which  tffr(t)  tends 
weakly  to  \jr(t),  then  since  ||  i/rN(t)  ||  =  ||  fr(t)  \\  for  all  N, 


2  ~ 


because  of  weak  convergence.   Thus  any  subsequence  of  the  ^N(t)  which 
tends  weakly  to  fr(t)  tends  strongly  to  that  limit. 
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If,  now,  $y(£)  does  not  tend  strongly  to  ijr(t),  there  is  a  number  e>0, 
and  a  subsequence  {r},  r->oo,  of  {N},  such  that 


for  all  r.  As  in  the  proof  of  theorem  I,  it  follows  that  we  can  choose  a  sub- 
sequence {s}  of  the  sequence  {r}  for  which  i/rs(t)  tends  weakly  to  a  solution 
of  (1A).  Since  ft(t)  is  the  only  solution  of  (1  A),  i/rs(t)  tends  weakly  to  fr(t): 
hence  i/ra(t)  must  tend  strongly  to  fr(t),  and  so  the  inequality  above  cannot 
hold  for  sufficiently  large  values  of  s.  It  follows  that  there  can  be  no  sub- 
sequence of  {N}  with  the  properties  required  of  the  subsequence  {r},  and 
therefore  that  ftN(t)  tends  strongly  to  ^f(t). 

This  gives  us  a  new  definition  of  the  operator  eitH  for  a  self-adjoint 
operator: 

V  y^n  i8  a  sequence  of  subspaces  of  §,  such  that  H  is  bounded  in  *$lnfor 
each  n,  such  that  90£n  C  9ftTO+1  for  all  n,  and  such  that  the  least  closed  linear 
manifold  which  contains  all  the  ^Jln  is  Jp,  and  if  En  is  the  projection  operator 
corresponding  to  Wln)  then  for  every  t  and  <f>  the  elements 


r=0    F! 

tend  strongly  to  an  element  i/r(t)  as  n-^co:  we  can  write 


||  \jf(t)  ||  =  ||  0  ||  ,  and  eriiH  is  the  inverse  of  eiiH,  so  that  eitH  is  a  unitary  operator. 

It  has  been  shown  that  subspaces  with  the  properties  required  of  the  30fcn 
exist,  and  the  statements  concerning  the  elements  ftn(t)  have  been  proved 
for  elements  <f>  in  $).  The  extension  to  any  element  ^  in  §  comes  immediately 
by  considering  a  sequence  <pr  of  elements  of  %)  which  tends  to  <fi.  eitu(f>  may 
exist  even  if  H<f>  and  Hi/r(t)  do  not. 

In  general,  whenever  (1A)  has  a  solution,  that  solution  is  unique.  For 
operators  which  are  not  self  -adjoint,  no  solution  of  (1A)  may  exist,  and 
while  there  must  be  at  least  one  solution  of  (1),  this  solution  is  not  unique, 
in  general. 

If  \lr(t)  is  a  solution  of  (1)  and  H  is  not  self-adjoint, 
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Hence  ||  \Ir(t)  ||2  is  not  necessarily  constant:  it  will  decrease  for  increasing 
t  if$m(H*i/rt  i/r)  is  positive  (Le.  if  ^  is  in  the  (£+  class  of  H)  and  will  increase 
if  tyn(H*i/r,  fr)  is  negative  (i.e.  if  i/r  is  in  the  (£~  class  of  H).  Thus  the  basis 
of  the  proof  of  uniqueness  falls  away.f 

4.    The  structure  of  maximal  operators 

The  properties  of  maximal  operators,  for  which  only  one  of  the  (£+  or 
(£~  classes  exists,  will  be  considered  in  detail.  It  will  be  assumed  in  what 
follows  that  only  the  (£~  class  exists  for  the  operator  H.  Operators  for  which 
the  (£+  class  alone  exists  can  be  brought  within  the  scope  of  the  reasoning 
by  considering  their  negative,  or  by  considering  values  of  t  of  opposite  sign 
to  those  considered  in  the  sequel. 

If  only  the  (£~  class  exists,  then  from  (6) 

(7) 


the  sign  of  equality  holding  if  and  only  if  ^  is  in  2).    Hence 
increases  with  t  .  The  argument  of  theorem  II  can  therefore  be  used  to  show 
that  the  solution  of  (1)  is  unique  for  t  <  0. 

In  the  proof  of  theorem  I,  a  solution  of  (1)  was  constructed  as  the  weak 
limit  of  a  sequence  {$p(t)},  which  satisfied  ||  ^fp(t)  ||  =  <f>  for  all^p;  this  implies 
that  the  solution  \/r(t)  satisfies 

IfWIMmi2-  (8) 

There  is  therefore  one  solution  of  (1)  for  t>0  (as  well  as  for  £<0)  which 
satisfies  (8).  From  (7),  ||  ft(t)  \\  cannot  decrease  as  t  increases,  hence  it  must 
remain  constant  for  £  ^  0.  It  then  follows  from  (6)  that  this  \jf(t)  is  in  1)  for 
all  ty  and  is  therefore  a  solution  of  the  weak  form  of  (1  A),  The  argument 
of  theorem  II  then  applies  to  show  that  this  solution  is  a  solution  of  the 
strong  form  of  (1  A),  and  that  it  is  a  unique  solution.  We  therefore  get  the 
following  : 

THEOREM  III.  If  H  is  a  maximal  operator  with  empty  (£+  class,  a  unique 
solution  of  (I  A)  exists  for  all  t^Q  for  all  0  in  3).  A  unique  solution  of  (I) 
exists  for  all  t<Q  and  all  (f>  in  ®. 

The  operator  which  transforms  an  element  <f>  of  $>  into  the  solution  \jr(t) 
of  (1  A)  for  t^  0  is  a  bounded,  in  fact  an  isometric,  operator.  Its  domain  can 
therefore  be  extended  to  include  all  ^>,  since  $)  is  dense  everywhere  in  §. 


t  See  Stone,  ref.  (2),  Chap,  ix,  particularly  pp.  343-344.  The  sign  of 
is  used  as  the  means  of  defining  the  (£+  and  (£~  classes,  in  this  paper,  so  no  reference 
to  the  resolvent  operator  is  involved. 
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This  operator  will  be  denoted  by  U(t).  Similarly,  the  operator  which  trans- 
forms <j>  in  $>  into  the  solution  of  (1)  for  t  <  0  is  bounded,  and  so  its  closed 
extension  includes  all  Jp  in  its  range.  This  closed  operator  will  be  denoted 
by  F(t)(«0). 

The  arguments  used  above  in  defining  eiiH  for  self-adjoint  H  all  apply 
here  equally  for  £  £?  0,  and  we  conclude  that  for  every  <f>  in  §  the  elements 
eitEnHEn^  (t^O)  tend  strongly  to  U(t)(f>. 

For  t  <  0,  it  is  obvious  that  the  sequence  frp(t)  will  converge  only  weakly 
to  i/r(t)  (except  for  those  elements  for  which  it  happens  that  ||  ijr(t)  ||  =  ||  <J>  \\ 
for  a  particular  t  <  0).  However,  we  can  prove  that  the  sequence  ftN(t)  as 
a  whole  converges  weakly  to  fi(t).  For,  if  this  is  not  the  case,  there  exists 
an  element  g,  an  e  >  0,  and  a  subsequence  {r}  of  the  integers  such  that  for 
every  r 

(Tjf(t)-i/rr(t),g)>e. 

From  {r}  a  subsequence  {s}  with  i]ss(t)  converging  weakly  to  a  solution  of 
(1)  can  be  chosen  as  in  the  proof  of  theorem  I.  Since  ijf(t)  is  the  unique 
solution  of  (  1  )  ,  i/r8(t)  must  converge  weakly  to  i/f(t),  and  therefore  no  sequence 
with  the  properties  required  of  {r}  can  exist. 

For  any  <j>  in  §  and  t<0,  write  V(t)<j)  =  ^(t).  Choose  a  sequence  {<f>{} 
from  *$),  0t-->0,  and  write  V(t)<f>i  =  ft^t), 


Then 


The  first  and  third  terms  on  the  right  are  each  less  than  ||  <j>  —  ^>i  ||  ||  g  ||  ,  and 
so  can  be  made  less  than  \e  by  taking  i  sufficiently  large.  The  second  terms 
tend  to  zero  as  n-^ao  because  of  the  weak  convergence  of  the  ^i>n(t)  to  fti(t); 
having  chosen  i  we  can  choose  n  to  make  this  term  less  than  ^e.  Hence  there 
exists  ^0(e)  such  that  for  n  >  n0(e) 


so  that  ^Qin(t}  is  weakly  convergent  to  i 
We  get  then: 

If  H  is  a  maximal  operator,  with  empty  (£+  class,  and  9Jlw,  En  are  defined  as 
in  theorem  /,  then,  for  every  <f>  in  $$,  eiiEnHEn<j)-+  U(t)<j)  strongly,  for  t^Q, 
yy)^  weakly,  for  ^0.  We  could  therefore  write  U(t)  =  eitn  for 
V(t)  =  eiiHfort^Q. 
It  is  clear  that  for  all  positve  t  and  T 
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From  lemma  1  (U(t)f,  U(t  +  r)g)  is  independent  of  t  for 
From  lemma  D,  if  t  <  0,  £-f  r  ^  0  and/  and  gr  are  in  2), 


=  0. 

By  considering  sequences  of  elements  in  5)  converging  to  arbitrary  elements 
in  §,  this  can  be  extended  to  any  elements  in  $.  Thus  we  have 

LEMMA  2.  For  any  elements  f  and  g  in  Jp,  (  U(t)f,  U(t  4-  T)  gr)  ^  independent 
oftfort^O,t  +  T^Oand(V(-t)f,  U  (-  1  +  T)  g)  is  independent  of  t  for  r^t^O. 
In  particular,  if  we  put  successively  £  =  0  and  t  =  r  we  get 


so  that  we  have  the  corollary: 

LEMMA  2A.  V(-t)f=  U*(t)f  for  every  fin  $,t>0. 
A  further  corollary  is 

LEMMA  2B.  //  1,  T  are  positive, 

V(-t)V(-T)=V(-t-T). 

It  is  required  to  prove  that  for  ^  in  § 


On  applying  lemma  2,  we  get 

(F(-*)F(-7)0>flr)  =  (V(-r)g,U(t)g) 
and  (F(-*-T)#,flr)  =  ( 

if  gr  is  any  element  in  $.  Hence  the  corollary. 

LEMMA  3.  H  and  U(t),  H*  and  V(  —  t)  commute  if  ^0;  more  precisely, 
if<f>  is  in  ®,  50  is  U(t)<f>  and 


if  <f>  is  in  <2)*,  so  is  V(  —  t)(f>  and 


For  ^  in  5D  the  left-hand  side  of  the  first  equation  exists  and 
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Since 


and  since  U(t)  is  a  continuous  closed  transformation, 


For  the  second  part,  let  0  be  in  ®*,  gr  in  %.  Then 


=  (H*<f>,U(t)g), 
on  applying  successively  lemma  2  A  and  the  first  part  of  lemma  3.  Hence 


so  that  F(-  00  is  in®*. 

We  can  therefore  put  in  the  equation  above 


and,  by  lemma  2  A,       (£T* 

and  the  equality  of  the  left-hand  sides  of  these  expressions  gives 


It  may  be  mentioned  that  17(0  does  not  transform  an  element  of  ®* 
which  is  not  in  $  into  an  element  of  $)*. 

LEMMA  4.  //^  is  a?w/  element  of  <3)*,  F(0  ^  *«  «ri  element  ofSt*  and  satisfies 
(I)  for  any  t^Q',  also 


Jn  particular,  for  t  —  0, 


50  ^Aai  a^f  —  0,  ||  F(00  1|  decreases  at  a  non-zero  rate  as  t  decreases,  unless  <j> 
is  in  ®. 

If  g  is  any  element  of  %  and  t  >  0, 


which  proves  that  (  1  )  is  satisfied.  The  rest  follows  immediately  from  lemma  D. 
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LEMMA  6.  (a)  If(j>  is  in  $),  then  att  =  0 


(b)  If<p  is  in%)*,  then  as  t->Q  through  positive  values, 


(c)  For  any  <j>in§,        \\  V(-t)<j>-<!>f  =  o(l), 

as  t  ->  0  through  positive  values. 

(d)  If<f)  is  in  %*,  V(t)<f>  (t  <  0)  satisfies  (I)  in  the  strong  sense  everywhere. 

(e)  7/0  is  in  $*,  then  att  =  Q 


~->- 

(d)  follows  from  the  fact  that  {U(t)(f>  —  <f>}/t  tends  strongly  to  iH<f)  as 
and  (b)  from  the  fact  that  {V(  —  t)<f>  —  (j>}lt  tends  weakly  to  —iH*<f>  as 
To  prove  (c),  let  {$n}  be  a  sequence  of  elements  of  *$)*  tending  to  <f>.  Then 


We  can  find  n  such  that  \\<f>  —  <j>n\\  ^Je,  and  then,  from  5(6),  t  such  that 
F(-<)^n-^I«-  Then 


It  was  proved  in  lemma  4  that  V(t)<f>  satisfies  (1)  in  the  weak  sense:  we 
have  for  all  t  >  0 

<P-  =  -iH*V(-t)<t>  =  -iV(- 


and  ||  V(  —  t)H*(p  \\  ^  ||  H*<p  \\  ;  thus  by  lemma  C  it  is  sufficient  to  show  that 
V(  —  t)H*(j)  is  a  continuous  function  of  t.  By  lemma  2  (6) 


=  0(1) 

from  lemma  5  (c). 

5  (e)  follows  immediately  from  5  (d),  since  the  latter  implies  that 

strongly. 


On  the  basis  of  these  theorems  we  can  determine  the  main  properties  of 
the  operators  U  and  F.  Both  have  §  as  domain.  The  range  of  U(t)  cannot 
determine  §,  for  if  it  did  U(t)  would  be  a  unitary  transformation,  and  the 
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inverse  transformation  {  £7  (£)}-*  would  have  £  as  domain.  If  that  were  so, 
the  function  [*7(-«)]-i0     (««>), 


would  give  a  solution  of  (1)  for  negative  t  and  any  <j>  of  %*  satisfying 
||  ft(t)  ||  =  ||  <j>  ||  .  Since  (1)  has  a  unique  solution  which  is  such  that  ||  ft(t)  \\ 
decreases  when  <f>  is  in  (£-  this  is  impossible. 

The  range  of  U(t)  is  a  closed  linear  manifold,  a  subset  of  $  which  will  be 
denoted  by  ${(t).  The  elements  U(t)<j>  for  <j>  in  %  form  a  linear  subspace  of 
3ft(£)  which  determines  3l($):  this  linear  subspace  will  be  designated  by  $>(*). 
The  projection  operator  corresponding  to  $l(t)  will  be  designated  by  E(t). 

It  is  easily  seen  that  for  any  0  (t^  T  ^  0) 

F(-T)*7(^=*7(*--T)</>.  (9) 

In  particular,  F(  -  *)  Z7(0  =  I.  (10) 

From  (9),  if  t&  T  >  0,       U(r)  V(  -  T)  Z7(«)  <f>  =  U(t)  <f>} 
and  in  particular  U(t)  V(  -  1)  U(t)  =  U(t)  ; 

so,  for  any  element  <j>  in  31(0, 

Z7(OF(-0#  =  0.  (11) 

As  [C/(0  F(  —  f)]  transforms  any  (f>  into  an  element  of  $l(t),  (11)  gives 
[E/«F(-<)]2=  U(t)V(-t). 

Moreover,  it  follows  from  lemma  2  A  that  U(t)  V(  —  t)  is  symmetric,  and  so 
[U(t)  V(  —  t)]  must  be  a  projection  operator.  f  Its  range  is  included  in  $l(t), 
and  as  it  transforms  any  element  of  $t(t)  into  itself,  its  range  mu^t  coincide 
with  m(t).  Hence  U(t)  V(_t)  ^  E(t)m  (12) 

Since  U(t)  is  isometric,      ||  E(t)$\\  =  ||  V(-t)<j>  ||.  (13) 

The  rate  of  change  of  ||  E(t)<p  ||  can  therefore  be  deduced  from  lemma  4. 
In  particular,  if  ^  is  not  in  %  but  is  in  (£~, 

!«*WI|2«>.  (H) 

Differential  independence 

The  following  definitions  will  be  introduced  here: 

Elements  flf  /2,  ...,/&  *w  *;£)*  are  5ai^  to  be  differentially  independent  with 
respect  to  U  if  there  exists  no  set  of  complex  numbers  aly...,ak  such  that,  if 

/=2 

then  ||  f-U(t)f  ||2  = 

This  will  be  written  /1}  /2,  ...,/fc  d.i.  (C7). 

t  Ref.  (2),  70,  def.  2-16  and  theorem  2-26. 
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Elements  /u/2,  ...}fk  in  S>*  are  said  to  be  differentially  independent  with 
respect  to  E(t)  at  t  if  there  are  no  complex  numbers  a^...^ak  such  that,  if 

/=£"*/*, 
then  ||  E(t)f-  E(t  +  &t)f  ||2  =  o(A£)     as    A*->0. 

This  will  be  written  f±,  /2,  .  .  .  ,  fk  d.i.  (E;  t). 

The  multiplicity  of  U  is  the  greatest  number  of  f  which  can  be  d.i.  (U). 

The  multiplicity  ofEatt  is  the  greatest  number  off  which  can  be  d.i.  (  E;  t). 

LEMMA  6.  The  multiplicity  of  E  att  =  0  is  equal  to  the  multiplicity  of  U. 
For  any/,          U(t)f-f=  (U(t)f-  E(t)f}  +  {E(t)f-f}, 

and  since  the  first  term  on  the  right  is  in  SR(£  )  and  the  second  in  the  manifold 
orthogonal  to  9ft(£)> 

II  U(t)f-f  U2  =  ||  U(t)f-E(t)f  ||2  +  ||  E(t)f-f  ||2. 

Hence,  if  ||  U(t)f-f  ||2  is  o(t),  so  is  ||  E(t)f-f  ||2.  The  multiplicity  of  U(t)  is 
therefore  not  less  than  that  of  E(t)  at  0.  To  prove  the  multiplicities  equal  it 
is  sufficient  to  show  that  the  first  term  on  the  right  is  o(t)  for  any  /in  X)*. 
The  term  in  brackets  is 

U(t)f-E(t)f=U(t)lf-V(-t)n, 
so  that  ||  t7(*)/-  E(t)f  ||2  =  ||  /-  V(  -  t)f  ||2  =  0(t*)  (15) 

from  lemma  5  (b),  for  any  /in  £)*. 

As  a  corollary,  we  see  that  elements  d.i.  (  U)  are  d.i.  (E]  0),  and  conversely. 

LEMMA  7.  The  multiplicity  of  E  at  any  positive  t  is  equal  to  that  at  0. 
Let  the  multiplicity  of  E  at  t  be  k,  and  let  flt  ...,/&  be  d.i.  (E;t).   Put 
/  =  Sar/r,  where  the  a's  are  arbitrary  constants: 

{E(t  )/-  E(t  +  A<)/}  =  U(t)  V(  -  t)f-  U(t  +  &t)V(-t-  &t)f 
=  17(1)  [V(  -t)-  U(M)  V(  -  At)  V(  -  t)]f 

from  (12).  Thus 


so  that  if  the  fr  are  d.i.  (E\  t)  the  elements  V(  —  t)fk  are  d.i.  (E;  0).   On  the 
other  hand,  if  the  fr  are  d.i.  (E',  0)  and/  =  Sar/r,  we  get 

[E(t)  -  E(t  +  A*)]  U(t)f  =  [U(t)  -  U(t  +  &t)V(-t-  A«)  E7(*)]/ 

(  -  M)  V(  -  1)  U(t)]f 


from  (11)  and  (12),  so  the  U(t)fr  are  d.i.  (E',t). 
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This  proves  lemma  7.  Taken  together  with  lemma  6,  it  proves  that  the 
multiplicity  of  E  at  any  t  is  equal  to  that  of  U.  In  future  we  shall  speak 
simply  of  the  multiplicity  of  E,  since  that  has  been  proved  independent  of  £. 

The  deficiency  index  of  H 

The  definitions  of  the  deficiency  index  of  a  symmetric  operator  given  in 
the  literature  depend  on  the  use  of  the  resolvent  operator.  As  we  are  not 
employing  the  theory  of  the  resolvent  operator  here,  it  is  as  well  to  give  an 
independent  definition  of  deficiency  index.  Since  we  are  concerned  only 
with  maximal  operators,  we  shall  consider  only  this  case:  it  is  possible  to 
extend  the  definition  to  general  symmetric  operators. 

The  deficiency  index  of  a  maximal  symmetric  operator  H  is  the  largest 
number  of  elements  of  <2)*  which  can  be  linearly  independent  (mod^),  i.e. 
which  are  such  that  no  element  linearly  dependent  on  them  belongs  to  fS). 

If  the  number  of  elements  linearly  independent  (mod^)  is  n  (finite  or 
countably  infinite),  the  index  is  written  (0,  n)  if  the  (£~  class  exists,  (n,  0)  if  the 
(£+  class  exists. 

It  is  evident  that  this  definition  is  equivalent  to  the  standard  definition, 
as  given  by  von  Neumann. 

It  will  now  be  proved  that  for  maximal  H  the  deficiency  index  of  H  is 
equal  to  the  multiplicity  of  E  or  U. 

If/i,  .  .  .  ,/fe  are  d.i.  (  U),  then  we  can  show  that/j,  .  .  .  ,fk  are  linearly  indepen- 
dent (mod®).  For,  if  they  are  not  linearly  independent  (mod®),  numbers 
ar  exist  such  that  £ar/r  is  in  $).  In  that  case 

\\f-U(t)f\\*~o(t) 

by  lemma  5  (a),  so  that  the/r  are  not  d.i.  (U). 
On  the  other  hand,  with/  as  above, 


so  that,  at  t  —  0,  using  lemma  4  and  (13), 


/,/), 
and  this  is  zero  if/  is  in  3).  Thus 

\\f-E(t)f\\*  =  o(t) 
if  and  only  if/  is  in  ®  .  Hence  the  following  theorem. 
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THEOREM  IV.  The  multiplicity  of  E  ia  equal  to  the  deficiency  index  of  H. 
Elements  linearly  independent  (mod  $)  )  are  d.i.  (E\  0)  and  conversely. 

Let  the  deficiency  index  of  H  be  (0,  n)  and  let  /x,  .  .  .  ,/n  be  a  set  (or,  if  n  is 
infinite,  a  sequence)  of  elements  of  $)*  linearly  independent  (mod  2))  and 
such  that  at  £  =  0 

-  jt  (E(t)f»  E  «/«)  =  *«  =  0     (*  *  0, 

=  1     (fc  =  Z).  (16) 

Such  elements  can  be  found  by  a  process  similar  to  the  Gram-Schmidt 
orthogonalization  process,  given  any  set  of  elements  linearly  independent 
(mod  £)).  Any  /  in  ®*  can  be  put  in  the  form 


where  g  is  in  <£).  It  is  easy  to  prove  from  (12)  and  lemma  4  that  at  t  =  0 


so  that  \\f-E(M)f\\*  =  M?,\ak\'*  +  o(t),  (17) 

-SKI*.  (17') 


The  expression  of  an  arbitrary  element  in  terms  of  the  operators  E(t)  and  U(t) 

The  projection  operators  E(t)  correspond  to  manifolds  ${(t)  such  that 
9l(029ft(<')  ift<t'.  As  t-+ao,  $t(t)  tends  to  a  limit,  say  SR(oo),  the  space  of 
all  elements  contained  in  every  $l(t).  E(t)  tends  to  the  corresponding  E(co). 
$t(oo)  may  be  empty,  or  a  Euclidean  space  of  a  finite  number  of  dimensions, 
or  a  Hilbert  space.  Since 

E(t)E(t')  =  E(t),     if    t>t', 
E(t)  [E(v)  -  E(u}]  =  0,          if    t  >  u  >  v, 


so  that  9t(£)  is  orthogonal  to  [9ft(v)O9tM]  if  t>u>v.   9ft(oo)  is  therefore 
orthogonal  to  [SR(v)©8t(w)]  for  any  u  and  any  finite  v,  v  <  u. 

$l(u)  is  the  space  of  all  elements  U(u)(f>,  so  that  U(t)  transforms  $l(u)  into 
8t(^-M).  Let0  be  an  element  of  [9*M0SK(w)]  (Q<v<u).  U(t)  transforms  0 
into  an  element  U(t)  <f>  of  $l(v  +  1).  (f>  is  orthogonal  to  SR(^),  so  that  if  g  is  any 
element  of  9*(w),  ($,$)  =  0.  Hence  (U(t)(j>,  U(t)g)  =  0.  Any  element  of 
^(M-f  t)  can  be  expressed  in  the  form  U(t)g  with  g  in  9ft(^);  hence  U(t)<f>  is 
orthogonal  to  3l(u  +  t).  It  follows  that  U(t)  transforms  [9i(v)00t(w)]  into 
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If  t  <  u,  V(  —  t)  is  isometric  in  0t(^)  and  transforms  it  into  9fit(w  —  t)  (see  (9)). 
Hence  U(t)  and  V(-t)  transform  <ft(oo)  into  itself,  and,  in  SR(oo),  V(-t)  is 
isometric  and  is  the  inverse  of  U(t),  so  that  U(t)  is  isometric  and  has  an 
isometric  inverse  in  the  space  SR(oo).  U(t)  must  therefore  be  unitary  in  $l(oo). 

For  any  <j>  of  ®  lying  in  $t(oo),  the  equation  (1) 


has  the  solution  ^r(t)  =  U(t)(f>  for  t^Q  and  ^r(t)  =  F(£)0  for  t^O.  For  all  e, 
II  ^W  II  =  0-  It  follows  from  (6)  that  \]f  is  in  3),  so  that  the  solution  is  a  solu- 
tion of  (  1  A).  Hence  (  1  A)  has  a  solution  for  alii  (  —  oo  <  t  <  oo)  for  all  elements 
of  5D  in  fft(oo),  and  this  solution  lies  in  fft(oo).  Consequently,  9ft(oo)  reduces  H, 
and  H  is  a  self-adjoint  operator  in  9ft  (oo);  this  is  a  consequence  of  theorem  III 
ofref.  (10). 

Any/  can  be  written 

)f,  (18) 


and  so  we  are  led  to  analyse  the  operators  E(t).  By  (18),  every  element  of 
the  space  £>09R(oo)  is  expressed  as  a  sum  (or  integral)  of  elements  of  the 
spaces  [%t(n8)Q${,(n  4-  1  #)].  We  have  also  seen  that  for  any  8  >  0  the  operators 


U(n&),  n  =  1,2,  ...,  transform  §09R(#)  into  the  spaces  [$t(nd)Q$t(n+  18)]. 
This  suggests  that  any  element  of  $09ft(oo)  can  be  expressed  as  a  sum  or 
integral  of  elements  U(nS)  [I  —  E(8)]f.  We  shall  give  this  idea  a  precise 
meaning,  and  deduce  an  analytic  form  for  the  expression  of  any  element 
of  §,  which  has  interesting  applications  and  throws  considerable  light  on 
the  structure  of  the  operators  H  and  U(t). 

Let  /jj/2,  ...,/£,  ...  be  a  set  or  sequence  of  elements  d.i.  (U).  We  shall 
suppose  these  to  be  normalized  so  that  (16)  is  satisfied.  With  each  element 
fk  we  associate  an  integral,  defined  as  follows. 

Let  (a,  b)  be  any  finite  interval,  p(t)  a  continuous  function  defined  in  it. 
Let  (a,  b)  be  divided  into  subintervals,  a  =  tQ<tl<  ...  <tn  =  b,  with 
max  (tr+l  —  tr)  =  3.  Write  (£r+1  —  tr)  —  A£r,  and  consider  the  sum 


The  contribution  to  this  sum  from  the  interval  (tr,  tr+l)  lies  in  the  space 
9l(^r+1)]  and  so  is  orthogonal  to  the  contributions  from  the  other 
intervals.  We  have,  therefore, 
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In  order  to  show  that  S  tends  to  a  limit  as  8  tends  to  0,  let  us  consider 
the  effect  of  inserting  subdivisions  in  the  intervals.  Let  (tr,  £r+1)  be  sub- 
divided by  divisions  tr  =  tfQ<t'^<  ...  <t'm  —  £r+1,  and  write  (^+i~^)  =  AtfJ. 
The  contribution  to  the  new  sum,  $',  from  the  interval  (trt  tr+l)  is 


i-o 
whereas  the  contribution  to  S  was 


(  -  tr)  -  E(tM  -  tr)]fk. 

i~0 

The  contributions  to  each  of  these  sums  from  the  interval  (t^  t'i+l)  lie  in 
[$l(£i)G>^(^+i)]  and  so  are  orthogonal  to  the  contributions  from  any  oth*^ 
interval.  Writing  these  contributions  Sir  and  S'ir,  we  have 

\\Sr-  SW=m£\\Sir-Sl\\\ 

where 


S'ir  -  8tr  =  p(t't)  U(t()  [I  -  E(btW  fk  -p(tr)  U(tr)  [E(t't  -  tr)  -  E(t'i+1  -  tr)]  fk 
i)  -P(tr)]  U(t't)  [I  -  *(A«J)]  h 

4-  P(tr)  {U(t't)  [I  -  fl(AtJ)]  -  U(tr)  [E(tl  -  tr)  -  B(t'i+l  -  tr)]}  fk 
(tt)-p(tr)']^i+p(tr)o-2, 
say.  Next,  writing  (t't  —  tr)  =  uf  , 


£  V(-Ui)}fk 

-F<  -«,)]/*. 
For  any  element  ^  of  ®*,  we  have,  from  lemma  3,  lemma  4  and  (13), 


<  2  | 
so  that  on  integrating  between  0  and  A£j  we  get 


<  2  1|  ^*[7-  F(  - 
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and  by  lemma  5  (c)  ||[I-  V(-  ut)]4  \\  =  o(l), 

as  i^-^O,  hence  as  A(£r)->0,  so  that   ||  crz  ||2  =  o(A^)  uniformly  in  i   as 
(Wi  *~  *r)  -*"  0-  As  f°r  tne  other  terms  in  Sir  —  S'ir, 


by  (17),  and  |  ^(^)—  p(^r)  |  ->0  as  (£r+1  —  £r)  ->  0.  Hence,  altogether, 

||Slr-^,P  =  o(Aa 
and  therefore  ||  8r  -  S'r  ||  a  =  o(S  A«0 


as  (^r+i  —  ^r)-^-0.    It  follows,   by  the  usual  arguments,  that  8  converges 
strongly  to  a  limit  as  the  maximum  length  of  the  intervals  (£r+1  —  tr)  tends 
to  zero.  The  limit  of  S  is  the  integral  we  wish  to  define.  It  has  a  certain 
resemblance  to  the  invariant  integral  for  a  group. 
The  limit  of  S  will  be  written  in  the  forms 


Fp(t)d(Uik;t),       Fp(t)d(U;fk;t), 

J  a  J  a 


to  indicate  the  dependence  on  U,  p(t)  and/fc.  In  the  sequel,  when  the/fc  is 
obvious  and  no  confusion  will  result,  we  shall  write  (U\t)  instead  of  (C/;  k\  t). 


rb 
It  is  easy  to  see  that       Mm  ||  S  ||2  =       |  p(t)  \*dt, 

J  a 

=  Pp(t)d(U;  k;  t),     &  =  \bq(t)d(U\  k't  t), 

J  a  J  a 

W=  r\ 

J  a 

(^^)=  r 

J  a 


and  hence  that  if 


then 


and 

The  extension  of  the  integral  to  a  general  function  of  L2(a,b)  is  im- 
mediate. Choose  a  sequence  of  functions  pn(t)  converging  in  mean  square 
to  any  p(t)  of  L2(a,  b).  Then  if 


4>n=  r 

J  a 

we  have  l^-^.ll1  -  fl  Pm~Pn\zdt, 

J  a 
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which  tends  to  zero  as  m,  n  tend  independently  to  infinity,  and  hence  <j>n 
converges  strongly  to  an  element  <f>.  This  element  <j>  is  the  meaning  we 

Jb 
p(t)  d(U\k\t).  The  restriction  that  (a,  b)  be  finite  can  be  removed 
a. 
in  the  same  way.  If  p(t)  is  any  function  of  J02(0,  oo),  and 


=  ( 
Jo 

then  \\<t>(T')-<t>(T)\\*=  f*\p(t)\*dt, 

J  T 

and  since  this  tends  to  zero  as  T,  T'->oo  independently,  <j)(T)  converges 
strongly  to  a  limit  as  T~*ao.  Hence  for  every  p(t)  of  L2(0,  oo),  the  integral 


o 
defines  a  definite  element  <j>  of  §,  such  that 


Consider  now  two  elements 

^  =  [bp(t)d(U\  k\  t),     ty  -  {bq(t)d(U\  l\  t). 

J  a  J  a 

Suppose  for  the  moment  that  p(t)  and  q(t)  are  continuous  and  a  and  6  finite. 
Then  (^,  ft)  is  the  limit  of 

fk,  U(tr)  [I-E(&tr}]f,) 


fc)/;)  -  (E(Mr)fk, 
By  (16),  the  expression  in  brackets  is 

AJr  +  o(AZr)     if    k  =  l, 
o(ktr)         if    k^l, 
and  it  follows  that  (0,  ^-)  =  0     (fc  =M). 

The  extension  to  general  p  and  g  of  £2(0,  oo)  is  immediate.  Hence,  if 
p(t)  and  g(^)  are  any  elements  of  L2(0,  oo),  and 


then  (0,  ^)  =       jpWg{0*     if 

Jo 


0  if    fc=M.  (19) 
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The  next  step  is  to  show  that  any  element  of  <2)*  can  be  represented  in 
terms  of  these  integrals.  The  integrand  is  suggested  by  (18).  We  have,  for 
any  t  >  0  and  for  any  /  in  <;£>*,  from  (12) 


=  U(t)[I-E(j\t)]V(-t)f.  (20) 

Since  V(  —  t)f  is  in  "2)*,  we  can  put 

F(-*)/=ScA(*)A  +  £(0,  (21) 

where  i(t)  is  in  "3).  Then 


=  \\[I-E(M)]V(-t)/\\* 


from  (17),  so  that  -~  ||  E(t)f  ||2  =  S  |  ck(t)  |2.  (22) 

On  integrating,         ||  /  ||2-  ||  E(co)f  ||2  =  S  f  *  c/c(£)  |2cfc,  (22a) 

Jo 

f  °° 

so  that  ck(t)  must  be  L2(0,  oo)  for  all  k,  and  21  1     |  c/c(0  |2^  must  be  finite. 

k  Jo 

These  facts  will  be  expressed  in  future  by  the  phrase:  the  set  {ck(t)}  is  of 
integrable  square.  This  phrase  implies  that  the  sum  of  the  integrals  (or  if 
they  are  infinite  in  number,  for  an  operator  of  infinite  deficiency  index,  the 
limit  of  the  sum)  is  finite. 

Let  functions  {(/&(£)},  ?}(t)  be  derived  in  the  same  way  for  an  arbitrary 
element  g  in  <2)*.  From  (22), 


for  any  complex  number  e,  and  from  this  we  derive 

~(E(t)f,  E(t)g)  = 
at 

If  now  <f>  is  the  element  of  §  defined  by 


we  shaU  have  E(t)  $  =  E  f  °°cfc(A)  c?(C7;  *?;  A),  (24) 

*  J« 


because  the  part  of  the  integral  for  </>  over  the  interval  (0,  t)  lies  in 
and  that  over  (t,  oo)  in  $l(t).  Any  element  (f>  is  the  strong  limit  of  elements 

02 
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defined  by  equations  of  the  same  form  for  which  the  cfe(A)  are  continuous 
and  vanish  outside  a  finite  interval  of  A  and  for  all  but  a  finite  number  of  k. 
There  will  therefore  be  no  loss  of  generality  in  the  following  argument  if 
we  assume  that  the  cA;(A)  do  have  these  properties:  the  extension  to  the 
general  case  is  immediate.  Under  these  conditions  (E(t)<fi,E(t)g)  is  the 
limit  of  the  sum 


where  0  =  A0  <  Ax  <  .  .  .  <  An  =  A  is  a  subdivision  of  the  interval  (0,  A) 
outside  which  the  ck(A)  =  0,  and  the  maximum  length  of  the  intervals 
tends  to  zero.  From  (20)  the  sum  is 

S  "iWw-JBtAA^A,  [/-^(AA,)]  V(-^)g) 

k  i-0 


(from  (21)) 


k  i-0 

from  (16)  and  (17).  Hence 

(E(t)<l>,E(t)g)  =  \ 

and  therefore  - -=-  (E(t)  </>,  E(t)  g) 

On  comparing  this  with  (23)  we  get 


so  that,  on  integrating, 


This  has  been  established  for  any  element  g  of  ®*,  and  hence  <j>  and/  duTer 
only  in  their  projections  on  9ft  (oo).  This  projection  is  obviously  zero  for  <f>. 
We  therefore  get  the  important  result  that  any  element  of  <£)*  can  be 
expressed  in  the  form 

(25) 


/==  2  rck(t)d(U;  k;  t)  +  E(<x>)f. 

k  JO 


where  the  coefficients  ck(t)  form  a  set  of  integrable  square,  and  are  deter- 
mined by  (21). 
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Any  element  <j>  of  §  is  the  strong  limit  of  a  sequence  {/„}  of  elements  of 
S)*,  and  by  (25)  each/,,  can  be  expressed  in  the  form 

/„  =  S  r^k(t)d(U;  k;  t)  +  E(ao)f» 

k  J  0 

where  for  each  v  the  set  {cj.}  is  of  integrable  square.  Then 


Since  /„  is  strongly  convergent,  this  expression  tends  to  zero  as  /*,  v  tend  to 
infinity  independently.  It  follows  that  for  each  k  the  sequence  cvk(t)  is  con- 
vergent in  mean  square  as  v->ao.  Ifck(t)  is  the  limit  in  mean,  it  is  clear  that 
the  set  (ck(t)}  is  of  integrable  square,  and,  if  U  has  finite  multiplicity,  that 


v->ao   k  JO 

This  result  is  not  quite  obvious  if  the  number  of  k  is  infinite.  In  that  case, 
for  a  given  e,  we  find  y0(e)  such  that  ||  /^  —  /„  ||2<  e  for  all  /£,  v>  v0,  and  then 
for  a  fixed  v  we  can  find  kQ  such  that 


For  any  fi>vQ 

I          f°°l    *      12     \*     /   -  f00 

U>fc0j0  j          I  fc  Jo 

From  this  we  deduce  that 


and  the  required  result  follows  easily. 

If,  then,  ^  =  S  Pc^O^ZJ;  &;  t)  +  E(ao)<{>, 

k  JO 

the  sequence  /„  converges  strongly  to  ^,  and  therefore  <f>  =  i/r. 

It  follows  that  every  element  of  §  can  be  expressed  in  the  form  (25) 
with  some  set  of  integrable  square.  The  converse,  that  to  every  set  of  in- 
tegrable square  corresponds  an  element  of  §,  was  proved  earlier.  There  is 
a  one-one  correspondence  between  elements  of  §09t(oo)  and  the  sets  of 
integrable  square,  provided,  of  course,  that  we  treat  functions  of  jL2(0,oo) 
as  identical  when  they  differ  only  in  a  set  of  measure  zero.  From  (25), 


U(t)f  =  S  f 

k  J  0 


S  f 

A:  Jo 
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Iff  is  in  3),  this  is  differentiable.  Leaving  out  the  projections  on  01  (oo), 
which  are  differentiable  by  themselves,  we  get 


-/=  S 


r 

JO 


i  A). 


Here  we  take  ck(X)  =  0  if  A  <  0;  we  shall  adopt  the  same  convention  in  future 
with  functions  of  I/2(0,oo).  Iff  is  in  <£>,  {U(t)f-f}/t  tends  strongly  to  iHf 
as  t  ->  0  ;  hence  there  must  exist  a  set  (0^(A)}  of  integrable  square  such  that 


dX  =  0. 


(26) 


The  existence  of  a  set  {<//c(A)}  satisfying  (26)  is  a  necessary  condition  for/ to 
be  in  ®.  We  consider  what  sets  of  functions  {cfc(A)}  satisfy  the  condition  (26). 
It  is  definitely  satisfied  by  sets  whore  the  ck(X)  have  the  form 


=  fA 
Jo 


where  the  {ck(A)}  are  of  integrable  square  and  we  take  gk(X)  =  c'k(A.),  for  in 
this  case  the  expression  in  (26)  becomes 


if 

;Jo 


\-t 


(\ 

Jo 


«-iS  I"" 

k  J  0 


Since 


I  ^(A-r)- 

is  of  integrable  square,  we  can  find  K  such  that 
S    r 

JoKj  0 


for  all  r,  and  the  contribution  to  the  sum  above  for  k  >  K  is  then  less  than  e. 
For  any  particular  k,      ^m 

l^fA-Tj-g^AJl^A-^O 
Jo 

as  T->0.f  Hence  we  can  find  r(k,  e)  such  that  this  expression  is  less  than 
e/K  for  r  <  r(k,  e).  Then  if  r(e)  =  min  r(k,  e),  we  get  for  t  <  r(e) 


t  See  Wiener,  The  Fourier  Integral  (Cambridge,  1933),  24. 


1944]  SYMMETRIC  OPERATORS  IN  HILBERT  SPACE  39 


and  then 


/*«>  fA 

t~*  \       ff 

jo         J  \-t 


dX  <  2e, 


so  this  expression  tends  to  zero  as  £->0. 

Now  let  {ck(X)}  be  any  set  satisfying  (26).  Put 


=  r 

JO 


O 

From  the  result  just  used,  and  Minkowski's  inequality 


poo 

we  see  that  lim  t~*  £  '      I  ^(^  -  *)  -  VkW  1  2  dA  =  0. 

i-M)          fc  JO 

Let  r(A),  s(A)  be  functions  of  L2(Q,  oo)  such  that 


r(A)  =  f 
Jo 


For  any  particular  k  we  must  have 

lirn*-1  f°°r(A)fe(A-0-^(A)}^A  =  0. 
Jo 

The  left-hand  side  is 

lim  f°  r^±i).T!W&(A)dA  =  f  " 

f-^-oJo  f  Jo 

This  is  true  for  every  function  s  (A)  for  which  the  corresponding  r(A)  is  in 
Z/2(0,  oo)  and  so  qk(A.)  =  0  almost  everywhere.  It  follows  that  the  necessary 
and  sufficient  conditions  that  a  set  (c/c(A)}  satisfy  (26)  are: 
(i)  cfc(0)  -  0,  for  all  k; 

(ii)  Cfc(A)  is  almost  everywhere  equal  to  the  integral  of  a  function  of  L2(Q,  oo), 
cjfc(A),  and  the  sets  {c^.(A)}  are  of  integrable  square. 

A  set  of  functions  satisfying  condition  (ii)  will  be  called  a  set  differentiable 
in  mean  square. 

The  conditions  (i)  and  (ii)  are  necessary  in  order  that  an  element  defined 
by  (25)  should  be  in  <%),  so  the  domain  of  H  is  included  in  the  set  of  elements 
generated  by  functions  satisfying  (i)  and  (ii).  For  an  element  in  $>  given  by 
(25)  we  have 

\  k;  A)+£L0(oo)/.  (27) 


Actually,  the  domain  of  H  is  identical  with  the  set  of  elements  generated  by 
functions  satisfying  (i)  and  (ii):  for  consider  the  operator  —  H  say  —  whose 
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domain  is  the  entire  set  of  such  elements  and  whose  value  is  given  by  (27). 
H  is  a  closed  symmetric  operator,  and  an  extension  of  H.  Since  H  is  maximal 
it  has  no  true  symmetric  extension,  hence  H^H. 

The  necessary  and  sufficient  condition  that  /  should  be  in  $)*  is,  con- 
sequently, that  E(co)f  be  in  $)  and  that  for  some  K 


r^)9lWdt    <x(x  r\gk(t)  \2dtY, 

Jo  \  k  Jo  / 


(28) 

for  every  set  {gk(t)}  of  integrable  square  satisfying  (i)  and  (ii).  It  is  easy  to 
see  that  a  sufficient  condition  for  (28)  is  that  the  set  {ck(t)}  be  differentiable 
in  mean  square:  and  it  will  now  be  shown  that  the  condition  is  necessary. 

The  set  of  all  sets  of  functions  of  integrable  square,  {gk(t)}}  forms  a  Hilbert 
space  IS  if  we  define  the  scalar  product  by 


It  follows  from  (28)  that 


is  a  bounded  linear  functional  in  IS,  with  norm  K.  L  is  denned  in  the  first 
place  only  for  {gk}  which  satisfy  (i)  and  (ii),  but  these  are  everywhere  dense 
in  the  space  and  so  L  may  be  considered  defined  for  all  {gk}.  By  the  standard 
theorem  on  the  form  of  a  linear  functional  in  a  Hilbert  space,  there  must  be 
an  element  {yk}  in  IS  such  that  tor  all  {gk}, 


k  Jo 

On  equating  this  to  the  original  expression  for  L  it  follows  that,  for  almost 


so  that  (ck(t)}  is  differentiate  in  mean  square. 


t  The  necessity  of  this  condition  can  be  derived  directly  from  lemma  5  (e).  Omitting 
projections  on  9?(oo),  we  have 

/»  oo 

V(  —  t)f—^\     ck(^  +  t)d(U;k;  &), 
k  J  0 

foo 

so  that  if  -ff*/  =  *  S       #fc(A)  d(  U;  k;  A), 

k  J  o 

,.     v-  f°°  Cfc(A  +  «)-cfc(A)  .    2  . 

then  hmS          — — gk(\)    dA  =  0. 

t~>o  k  J  o  * 

An  argument  similar  to  that  carried  out  above  for  elements  in  !t)  shows  that  this  is 
equivalent  to  condition  (ii). 
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If /is  given  by  (25)  with  the  (cfc(A)}  a  set  of  integrable  square  and  differen- 
tiable  in  mean  square,  #*/is  given  by  a  formula  similar  to  (27): 


=  *Z  rckWd(U;  k'>  A)  +  #-#(oo)/. 

k  JO 


All  the  elements  denned  by  putting  ck(t)  s  0  for  all  k  except  one  particular 
value  of  k,  say  r,  define  a  closed  linear  manifold,  which  we  shall  call  9!ftr. 
%Rr  must  be  a  Hilbert  space,  since  every  element  ck(t)  in  £2(0,  oo)  defines  a 
different  element  of  ^Rr,  in  such  a  way  that  the  relation  is  a  homomorphism 
between  %Rr  and  L2(0,  oo).  It  is  plain  from  (  1  9)  that,  if  r  4=  s,  3ftr  is  orthogonal 
to  W18.  The  formulae  (25)  and  (27)  show  that  9ftl5  .  .  .,  ^0lr,  .  .  .  and  9t(oo)  reduce  H. 
Together  they  determine  the  whole  of  the  Hilbert  space  §. 

H  is  not  reducible  inside  any  one  of  the  $Rr,  for  if  it  were  there  would 
exist  a  closed  subset  9£  C  9Dftr  such  that  if  <j>  is  in  ®  n  92  (the  intersection  of 
•3D  and  9£),  H<f>  would  lie  in  $1.  It  is  obvious  that  the  solutions  of  (1  A)  for 
any  ^  in  ®  n  -ft  would  lie  in  9ft.  For  all  t  >  0,  U(t)  would  therefore  transform 
9?  into  a  subspace  of  itself.  If,  therefore,  /  is  any  element  of  <D*  n  9^,  the 
integral 

3= 


would  give  only  elements  of  92.  This  is  in  contradiction  with  the  fact  that 
since 

f=j\(t)d(U;r'9t)     (cr(0)*0), 

we  must  have 

||  [I  -  E(W]f-  [I  -  E(*t)]  <v(0)/r  ||  2  =  o(A«), 

and  therefore  the  integral  Qf  must  give  all  elements  of  9Jlr.t 

C  fiange  of  elements  fk 

We  shall  now  investigate  what  changes  are  brought  about  in  the  expres- 
sions considered  above  if  the  set  of  elements  of  £)*,  {/&},  is  changed. 

If  we  consider  the  definition  of  the  integral  in  (U;fk't  t),  it  is  obvious 
that  if  we  take  any  /  of  2)*  in  the  sums  S,  the  sum  $  will  tend  to  a  limit 


where 


r 

l^Q  J  0 


t  This  result  is  equivalent  to  the  result  that  i(d/dx)  is  not  reducible  in  L*(Q,  oo), 
and  could  be  either  deduced  from  that  result  or  regarded  as  a  new  proof  of  it. 
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It  is  therefore  obvious,  from  (17),  that  if  /  is  in  $),  the  integral  has  the 
value  zero  for  every  p(t).  If  we  substitute  for  /  the  sum  of  two  elements  g 
and  h  of  $)*,  the  resulting  integral  is  the  sum  of  the  integrals  resulting  from 
g  and  h  alone.  In  particular,  addition  of  an  element  of  ®  to  an/  makes  no 
difference  to  the  resulting  integral. 

Since  the/fc  are  themselves  elements  of  Q*  they  are  given  by  formulae 
of  the  form 


where  the  {cki(t)}  for  each  k  are  of  integrable  square  and  differentiable  in 
mean  square,  and  are  given  by  (21),  from  which  we  derive  cw(0)  =  8M, 
where  dkl  is  the  Kronecker  delta.  We  can  change  from  the  set  {fk}  to  a  new 
set  {/^}  given  by 

/i  =  2        c'M(t)d(V',l',t)  +  E(ao)f'k. 

i  Jo 

We  require  that  the  new  set  be  such  that  all  elements  of  1)*  can  be  expressed 
(mod®)  linearly  in  terms  of  it  —  and  for  this  it  is  sufficient  that  the/^  can 
be  expressed  in  terms  of  the  new  set.  It  is  necessary  and  sufficient  for  this 
that  the  matrix  cw(0)  have  an  inverse.  In  addition,  the  set  {fk}  should  satisfy 
(16),  so  that  _ 

0^(0)==  *„,*., 


that  is,  the  matrix  ||  ckl(0)  \\  must  be  unitary. 

In  the  simplest  case,  the  cki(t)  satisfy  cw(0)  =  d^.  In  that  case  it  follows 

from  (17)  that  -=-  1|  E(t)  (fk  -fk)  \\  2  =  0  for  t  =  0,  and  hence  that  fk  -fk  is  in  ^  : 

the  change  of  ffc  in  this  case  does  not  alter  any  of  the  integrals. 
In  the  general  case,  we  have 


where  g  is  an  element  of  5)  .  Iff  is  any  element  of  §  given  by 
/=  S  rck(t)d(Uifk-t  t)  =  s  P°c; 

k  J  0  Z    JO 

we  must  therefore  have         ck(t)  —  ^  c't(t)  c'lk(Q). 

The  manifolds  ^fflk  defined  by  the  new  set  {fk}  are  not  the  same  as  the 
manifolds  •Jft*..  Taken  together,  the  9ftJ.  determine  the  same  subspace  — 
$©9ft(oo)  —  of  §  as  do  the  50^fc;  but,  in  the  general  case,  no  9%  need  have 
any  element  except  the  zero  element  in  common  with  any  9Dftj.  The  manifolds 
$Rk  are  not  uniquely  determined  by  H,  unless  it  has  deficiency  index  1  . 
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The  formulae  will  take  the  simplest  form  if  thefk  are  chosen  to  lie  in  the 
yftk  which  correspond  to  them;  in  particular,  they  could  be  chosen  to  be 
characteristic  vectors  of  the  transformation  H *,  as  will  now  be  shown. 

For  let  fk  =  f *e~*d(U\  k\  t) ; 

Jo 

then  H*K  =  -i  fV'd(*7;  k\  t). 

Jo 

Thus  H*fk  +  ifk  =  0.  Obviously  any  element  in  <$lk  which  satisfies  this 
condition  is  a  multiple  of  ^flk.  There  are  therefore  n  mutually  orthogonal 
elements  satisfying  //*/+  if  =  0,  one  in  each  of  the  manifolds  Wlk. 

The  main  results  reached  above  are  summed  up  in  the  following  theorem : 

THEOREM  V.  If  H  is  a  maximal  symmetric  operator,  of  deficiency  index 
(0,  ri)>  where  n  may  be  finite  or  countably  infinite,  then  H  is  reduced  by  (n-\- 1) 
manifolds,  Wlr  (r  —  1,2,  ...,n)  and  SR(oo)  which  together  determine  Jp.  SR(oo) 
may  be  empty,  an  Euclidean  space  of  finite  dimension  number,  or  a  Hilbert 
space.  Inside  9R(oo),  H  is  self-adjoint.  9?(oo)  is  determined  uniquely  by  H. 

The  manifolds  *>fftk  are  Hilbert  spaces,  and  inside  each  one  of  them  H  is  an 
irreducible  maximal  symmetric  operator,  of  deficiency  index  (0, 1).  Elements 
fr  exist  for  r  =  1,2,...,%  such  that  every  f  of  §  can  be  put  in  the  form  (25) 
where  the  ck(t)  are  arbitrary  functions  of  L2(0,  oo),  the  set  {ck(t)}  is  ofintegrable 
square,  and  »& 

U/|«-|l*<°°)/ll'+2      K( 

k  J  0 

Each  fr  corresponds  to  one  9ftr,  the  elements  of  which  can  be  put  in  the  form 

rcr(t)d(U;r',t).  (29) 

Jo 

The  manifold  determined  by  all  the  %ftk  together  is  uniquely  determined  by 
H,  but  if  n  >  1  the  individual  ^lk  can  be  chosen  in  infinitely  many  ways. 

The  necessary  and  sufficient  conditions  that  f  be  in  <£)*  are  that  the  set  {ck(t)} 
be  differentiate  in  mean  square  and  E(co)f  be  in  <3),  when  H*f  is  given  by 
(21  a).  The  additional  necessary  and  sufficient  condition  for  f  to  be  in  $)  is  that 
ck(0)  =  0/or  all  k,  when  (27)  gives  Hf. 

Irreducible  maximal  symmetric  operators 

The  results  above  show  that  an  irreducible  maximal  symmetric  operator 
R  is  generated  by  a  set  of  isometric  operators  U(t)  of  multiplicity  1,  such 
that  the  corresponding  E(t)  tend  to  zero  as  t  tends  to  infinity.  Any  /  in  § 
is  expressible  by  roo  roo 

/=       c(t)d(U;fct)=\    c(t)d(U;t),  (30) 

J  n  *  .In 
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where  <p  is  an  element  of  (£~,  arbitrary  save  for  a  multiplying  constant,  and 
c(t)  is  L2(Q,  co).  Further,  /is  in  $*  if  c(t)  is  differentiate  in  mean  square,  and 
in  <£)  if  in  addition  c(0)  =  0.  Then 

V(0<Z(C7;0;0.  (31) 

These  formulae  apply  to  operators  with  deficiency  index  (0,1).  Those 
with  deficiency  index  (1,0)  are  given  by  the  same  formulae,  save  for  a 
change  of  sign  on  the  right. 

The  inverse  of  R  exists  for  a  given/  if  the  function 


f 

Jo 


is  Z2(0,oo),  and  in  that  case 

R-*f=  -i^\(t)d(U;  </>;  t).  (32) 

This  characterization  of  the  irreducible  maximal  symmetric  operator 
may  be  compared  with  that  of  von  Neumann.  f  von  Neumann  defines  the 
operator  R  by  means  of  the  isometric  transformation 


Given  a  complete  normal  orthogonal  set  of  elements  $n,  V  is  defined  to  be 
the  operator  which  transforms  0ft  into  <f>n+v  If  V  satisfies  this  condition,  H 
is  the  irreducible  maximal  operator  R. 

In  the  notation  of  this  paper,  the  set  <j>n  can  be  expressed  as  follows,  fa  is 

the  element  satisfying  Ssh  ,       .  , 

R^fa  +  ifa  =  0, 

/«00 

and  so,  since  ||  <f>n  \\  =  1,        fa  =  2       e~ld(U\  t)  ; 

Jo 

and  in  general  it  is  then  easy  to  see  that 


where  Ln(t)  is  the  nth  Laguerre  orthogonal  function. 

The  conjugate  operator 

The  results  given  above  amount  to  an  explicit  formulation  of  the  homo- 
morphism  of  R  and  the  operator  i(d/dx)  in  £2(0,oo).  This  suggests  the 
existence  of  an  operator  $  conjugate  to  R  in  the  sense  of  quantum  mechanics  : 
thati8,Sati8fying  RS-M  =  iI,  (33) 

t  See  ref.  (1),  or  ref.  (2),  Chap.  ix. 
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just  as  the  particular  realizations  R  —  i(d/dx)  and  S  =  x  satisfy 

•A    _•   A_  • 
dx  dx 

The  definition  of  S  is  obviously  as  follows,  If 

c(t)d(U\t), 

i 

then/  is  in  the  domain  of  S  if  and  only  if  tc(t)  is  £2(0,  oo)  and  then 

*te(t)d(U\t).  (34) 


Jo 


It  is  easy  to  see  that  (33)  is  satisfied. 

An  operator  conjugate  to  any  maximal  operator  can  be  defined  in  a 
similar  way  provided  J57(oo)  =  0.  If  E(co)  4=  0,  we  can  get  an  operator 
which  satisfies 

RS-SR  =  i(I- 


S  is  clearly  a  self-adjoint  operator,  with  a  spectrum  covering  the  entire 
range  from  0  to  oo. 

From  (18)  we  can  write  the  expressions  for/,  Sf  in  the  form 


/  =  -  f  "W)/,     Sf  =  -  f  %  dE(t)f. 

Jo  Jo 


(35) 

E(t)  are  the  projection  operators  corresponding  to  8,  in  the  appropriate 
resolution  of  the  identity. 

The  solutions  of  equation  (1) 

For  any  (j>  in  ®,  the  equations  (1)  have  an  infinite  number  of  solutions 
for  t  >  0.  The  solutions  constructed  in  the  proof  of  theorem  I  for 


0  =  S  f  °°c*(A)d(*7;  k; 

k  J  0 

can,  in  our  present  notation,  be  written  for  t  >  0 

=  S  rCk(X)d(U't  k\  A  +  0+  U(t)E(oo)<f>, 

k  J  0 


and  for  £  <  0 

°°;  k;  \  +  t)  +  [U(-t)]-i  E  (<*)</>.        (36) 


For  t  <  0  there  is  no  other  solution  of  (1  ),  but  for  t  >  0  other  solutions  can  be 
obtained  by  addition  of  elements  of  §091(0  to  U(t)^>.   In  fact,  if  dk(\)  is 
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any  set  of  functions  forming  a  set  of  integrable  square  and  differentiable  in 
mean  square  over  any  finite  interval  and  with  dk(0)  —  0  for  all  k,  then  the 
elements 


are  solutions  of  (1)  with  ^(0)  =  0. 

If  $  is  given  by  the  above  formula,  with  ck(0)  4=  0,  so  that  <j>  is  in  <£)*,  then 
U(t)(f>  is  not  in  3)*,  and  so  is  not  a  solution  of  (1).  But  if  we  add  to  U(t)<j> 
an  element  i/f^t)  denned  as  above,  except  that  we  must  have  dk(0)  =  c&(0), 
instead  of  ^(0)  =  0,  then  U(t)(f>+  ijr^t)  is  a  solution  of  (1)  with  ^(0)  =  <j6. 
Hence  the  enunciation  of  theorem  I  is  also  true  for  elements  in  $)*. 

Analogue  of  the  spectral  resolution  of  self  -ad  joint  operators 

It  is  impossible  to  give  a  representation  of  an  operator  which  is  maximal 
but  not  self-adjoint  similar  to  that  which  exists  for  self-  ad  joint  operators. 
However,  a  representation,  analogous  to  the  spectral  resolution  of  self- 
adjoint  operators  in  one  respect,  can  be  found:  it  is  possible  to  give  a  repre- 
sentation in  which,  when/is  represented  by  a  given  integral,  Hfis  represented 
by  a  similar  integral  with  the  integrand  multiplied  by  the  independent 
variable.  It  will  be  sufficient  to  prove  this  for  the  irreducible  operator  R. 
In  (30),  c(t)  is  £2(0,oo),  so  that  if 


1      fA 
then  c(t)  =  l.i.m  .  -rr^-r        C(u)  e~iut  du, 

V(27T)J_A 

by  the  well-known  theorem  on  Fourier  transforms  in  L2(0,  oo),|  and  if 

rt 

c(t)  =       c'(t)dt,  where  c'(t)  is  L2(0,oo), 
Jo 

1      fA 

c'(0  =  l.i.m.  -r-r-T         (-iu)  C(u)e~iutdu 
V(27r)J-A 

almost  everywhere.  Thus  (31)  gives 


See  ref.  (5),  theorem  48. 
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It  is  easy  to  show  that  the  order  of  integration  can  be  inverted,  using  the 
facts  that  c(t)  and  c'(t)  are  L2(0,oo),  while  C(u)  and  uC(u)  are  £2(- 00,00). 

Then  i  /»«>  /»a» 

/=  lim          C(u)du\    e~iutd(U',t), 

V^        /  T— >-oo  J  — oo  J  0 

where  the  integral  over  (  —  00,00)  means  the  limit  of  the  integral  over 
( —  A,  A).  The  inner  integral  is  equal  to 

d  rre--'-id(t7;0_ 


—it 
From  (22),  we  have  for  all  T 


T 


-d(U-t) 


2 


-it 


dt 


r 
I 

Jo 


The  total  variation  over  an  interval  (u,  v)  of 

-  T  e~iut  _  l 

J  o          it 

is  therefore  less  than  -jTr  |  u  —  v  \ .  This  function  therefore  tends  to  a  function 
of  bounded  variation  as  T  tends  to  oo,  and  we  can  show,  as  for  functions  of 
a  real  variable  (to  which  case  this  is  indeed  reduced  by  considering  (f,g) 
for  an  arbitrary  g),  that  we  can  write 

/=-/7^r    C(u)d(W;u),  (37) 


/•oo  p~iut  _  l 

where  (W;u)=\    -  -  r-^d(U;t). 

Jo      —w 

If  Hf  exists,  it  corresponds  to  the  function  ic'(t)  which  is  then  £2(0,oo), 
satisfies  c(0)  =  0,  and  has  Fourier  transform  uC(u).  Thus  if  /is  given  by  (37) 


Clearly,  for  <2),          C(u)  du  =  0.  Iff  is  in  <£>*,  and  c(0)  =|=  0,  the  Fourier  trans- 

J  —  CO 

form  of  ic'(t)  is  ic(Q)  -f  uC(u). 

A  formula  for  (H  —  z)~*  can  be  derived  from  this,  or  verified  independently. 
If  3(z)  <  0,  we  get  for  any  /  in  ^> 


The  expression  for  this  directly  in  terms  of  c(t)  is 

i  t) 
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The  representations  (37)  and  (38)  cannot  be  spectral  resolutions,  for  only 
a  self-adjoint  operator  can  have  a  spectral  resolution.  We  shall  discuss  this 
in  more  detail  shortly. 

The  C(u)  are  restricted  to  belong  to  the  class  of  functions  of  L2(  —  oo,  oo) 
which  are  transforms  of  functions  of  L2(0,  oo);  this  set  of  C(n)  gives  the 
Hilbert  space  Jp  by  the  representation  (37).  If  we  consider  the  set  of  all 
C(u)  in  Lz(  —  ao,ao),  and  for  each  C(u)  define  an  element/  by  the  formula 
(37),  the  set  of  elements  so  defined  constitutes  a  Hilbert  space  ^  if  we  define 
the  scalar  product  by 


This  space  ^  contains  the  space  §:  for  if  C^u),  C2(n)  are  transforms  of 
functions  of  Z/2(0,  oo),  then 

(A>/2)= 

JO 

by  Parseval's  theorem.  In  §1  the  formula  (38)  defines  a  transformation 
H!  which  is  equal  to  H  for  elements  in  $£)  but  is  defined  in  a  set  ^  everywhere 
dense  in  $.,_:  the  set  with  uC(u)  in  L2(  —  oo,  oo).  This  operator  HL  is  obviously 
self-adjoint  in  $$lf 

It  is  obvious  that  the  formulae  (37)  and  (38)  give  the  spectral  resolution 
formula  for  H±  in  §IP 

That  (37)  and  (38)  are  not  spectral  resolution  formulae  in  §  can  be  seen 
as  follows.  If  c(t)  is  any  element  of  L2(Q,  oo),  C(u)  its  transform,  then  if 
C^(u)  is  the  function  equal  to  C(u)  in  an  interval  (A,/^),  the  formula  (37) 
applied  to  CXfl(u)  defines  an  element  of  §13  not  in  <p;  for  the  transform  of 
<7A/i(w)  must  be  an  analytic  function  of  t,  since  Cx/l(u)  vanishes  outside  a 
finite  interval,  and  can  therefore  not  vanish  for  all  t  <  0.  If  the  transform  is 
cA/c(J)  (a  function  of  L2(  —  oo,  oo)),  we  have 


f"  c^(t)^(t)dt=  r 

J   —  00  J    — 


for  any  two  intervals  (A,//,),  (A',X).  This  shows  that  the  components  corre- 
sponding to  non-overlapping  intervals  are  orthogonal  in  Jpjj  but  the  pro- 
jections on  to  §  of  elements  generated  by  non-overlapping  intervals  are 
not  necessarily  orthogonal.  The  scalar  product  of  these  projections  is  in  fact 


which  need  not  be  zero  for  non-overlapping  intervals. 
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These  results  can  be  extended  to  general  maximal  operators:  this  merely 
involves  replacing  the  integrals  (37)  and  (38)  by  sums  of  integrals  over  the 
%Rk  and  adding  projections  on  SR(oo),  E(ao)f  and  HE(co)f  to  them. 
We  thus  get 

THEOREM  VI.  //  H  is  a  maximal  operator  in  a  Hilbert  space  §,  then  § 
can  be  imbedded  in  a  Hilbert  space  in  which  there  exists  a  self-adjoint  operator 
equal  to  H  for  elements  lying  in  the  domain  ®  of  H. 

The  integrals  in  (37)  and  (38)  are  analogous  to  ordinary  Stieltjes  integrals, 
and  to  the  integrals  in  the  ordinary  spectral  resolution  theory,  in  that 
( W'}  u)  is  an  element  for  every  u  and  has  bounded  variation  over  every  finite 
interval.  They  are  unlike  the  integrals  in  (U-,  k\  t)  in  this,  for  these  are  not 
integrals  in  terms  of  the  variation  of  elements  of  §.  (37)  and  (38)  do  express 
H  in  terms  of  projection  operators:  for  the  operator  which  transforms/ 
given  by  (37)  into  the  projection  on  to  $  of  the  element  of  ^ 

C(u)d(W\u) 


is  a  projection  operator:  but  the  manifolds  corresponding  to  non-overlapping 
intervals  are  not  orthogonal  (in  §). 

6.   Applications 

Theorems  I  and  II  have  an  application  to  the  method  adopted  by  Professor 
Titchmarsh  for  the  proof  of  the  expressions  of  functions  in  terms  of  eigen- 
functions  of  Sturm -Liouville  differential  equations.  In  his  paperf  Professor 
Titchmarsh  starts  with  the  Schrodinger  equation  corresponding  to  the 
Sturm-Liouville  operator,  but  uses  this  equation  only  in  a  heuristic  fashion: 
for  the  deduction  of  the  expansions  he  does  not  actually  make  use  of  any 
assumption  as  to  the  existence  of  a  solution  of  the  Schrodinger  equation, 
but  works  with  its  Fourier  transform  in  the  complex  plane — or,  what  comes 
to  the  same  thing,  with  the  resolvent  operator  of  the  Sturm-Liouville 
operator.  The  method  of  this  paper  allows  of  a  rigorous  use  of  the  Schrodinger 
equation:  in  fact,  it  gives  immediately  the  result  that  if  H  has  the  discrete 
spectrum  {Aw}  with  characteristic  functions  <fin,  then  if  (^,  (f>n)  —  cn  the  series 
^cneiXni(j)n  tends  to  <f>  as  t  tends  to  0,  if  <f>  is  a  continuous  function. 

Theorems  I  and  II  can  also  be  used  to  prove  the  existence  of  solutions  of 
differential  equations  which  are  not  of  Schrodinger  type.  Consider,  for 
example,  the  Hilbert  space  consisting  of  elements  {/i,/2>/3>/4}  =/»  where 
A»  /2>  /a»  /*  are  functions  of  x,  y,  z  and  of  integrable  square  over  all  values  of 

t  See  ref.  (11). 
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(xtytz).  Sum  of  two  elements,  multiplication  by  a  complex  number,  and 
inner  product  are  defined  by  taking  the  Hilbert  space  of  the  /  as  the  direct 
sum  of  that  of  A,  A,  /3,  /4;  e.g., 

({A.  A>  A» A}»  &!>  g*  g*  04})  =  (A,  gj  +  (A, 

In  this  space,  the  operator 


0          0 


0       °     ^JT 
i  a 


0 


0      °     ^:r 

l  OZ 


is  easily  seen  to  be  self-adjoint.  The  equation 


with  f  (0)  =  {/,  ,  /„  /„  /,},     f(t)  =  {^(*),  f  ,(*),  f  ,(«),  M)}, 

has  therefore  a  solution  for  any  set  of  {/15  /2,  /3,  /4}  =  /  whose  differential 

coefficients  are  of  integrable  square,  and  the  solution  is  unique.  The  equations 


hence 


3ft 


3V. 


dy       dz         dt 


so  that  v^4  satisfies  the  wave  equation.  On  choosing  /4  = 
such  that  2*      £j>     g /. 

>\ . .  i  >\. .  ~r  ~^r~        !Yj 


and  A,  A»  A 


the  existence  of  a  solution  of  a  wave  equation  with  initial  conditions 
^4  =  0,  8^4/3£  =  ^  at «  =  0  is  demonstrated.  The  restrictions  that  (f>  and  x 
and  their  differential  coefficients  must  be  of  integrable  square  are  easily 
removed. 

Some  examples  of  the  representations  (30),  (31)  and  (32)  may  be  given. 
If  H  is  the  operator  i(d/dx)  in  L2(0,  oo),  defined  for  functions  of  the  form 


:  r 
Jo 
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with  g(r)  in  L2(0y  oo),  the  results  are  obvious.  Here  for  t  negative 

V(t)f(x)=f(x-t), 

for  t  positive  U(t)f(x)  =  f(x-t),  where  f(x)  =  0  for  #<0.  E(t)f(x)  is  the 
function  equal  to  zero  for  Q^x^t,  and  to  f(x)  for  x^t.  If  we  choose  f(x) 
with/(0)  =  1,  roo 

c(t)d(U\f-t)  =  c(x). 
Jo 

Formulae  (37)  and  (38)  reduce  to  the  Fourier  integral  formulae. 

A  realization  of  R  given  by  von  Neumann  (ref  .  (I),  Anhang  n)  is  as  follows. 
The  functions  of  the  complex  variable  z  regular  for  |  z  \  ^  1  and  with  the 
integral  (  f(g)  |a 

~- 


bounded  for  any  circle  inside  the  unit  circle,  form  a  Hilbert  space.  R  is 
realized  by  taking  ^  , 

V  -  ?       K  -  11+2! 

V     —  Z,        J-l   —  -.-  -  -  . 
*  1—  Z 

U(t)  is  then  the  operator  which  multiplies  a  function  byexpi—  £-  -  1. 
The  resolution  of  a  function  (30)  becomes 

F(z)  =  l- 

^         TT 

This  formula  can  be  deduced  from  the  Fourier  inversion  formula  for  func- 
tions in  L2(Q,  oo)  by  putting  w  =  i(z  +  l)/(z—  1). 

Some  interesting  realizations  of  R  can  be  got  in  JD2(0,oo).  Starting  with 
Fourier's  cosine  formula  as  a  particular  case  of  (30),  we  proceed  to  find  the 
corresponding  R.  We  have 

f(x)=    /(-)       F(u)cosuxdu, 

where  F(u)  =    /(-)       f(x)cosuxdx. 

Then  (eqn.  (35)), 

U(t)f(x)  =  J(-\  ^F(u-t)cosuxdu, 

and  so  from  (31)  R  exists  if  F(Q)  =  0,  and  F'(u)  is  L2(Q,  oo),  and  then 


Joo 
F'(u)cosuxdu 
0 


(39) 

D2 
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under  suitable  conditions.  Similarly,  from  (32), 

**«»)-      "*        -  + 


This  formula,  with  (39),  leads  to  inversion  formulae  equivalent  to  those  for 
the  Hilbert  transform  :  the  theory  of  the  Hilbert  transform  in  L2(  —  oo,  oo) 
can  be  deduced  from  the  present  theory. 

The  same  method  can  be  applied  more  generally  on  the  basis  of  the  theory 
of  Watson  transforms  and  general  transforms  (see  ref.  (5),  chap.  vni).  Let 


/(*)=  rh(xy)g(y)dy,  g(x)  =  f°°A;(^)/(i/)%,  (40) 

Jo  Jo 

where       k(x)  =  ~  fa+X°°t(s):r-8ds,  h(x)  =  -L  fa+i°°  $(s)x-*ds,          (41) 
2mja-iao  2mja_ioo 


with  $(s)®(l-s)  =  I. 

If,  now,  0(0)  =  0  we  can  put 


- 

J  o 

=  ilim  fV*A(3»)  (|-  f  " 

*-»oJo  \^2/Jo 

=  »  f  "^(g/xj/tgjdg,  (42) 

Jo 

where  ^(fi)  =  ^j  Jft(«)  $(-«)(-«)  g-8  «to. 

The  analysis  is  easily  justified  for  a  suitably  restricted  class  of  functions 
/,  $>>  ^,  and  the  restrictions  can  be  removed  by  considerations  of  continuity 
of  the  operators.  Then 


(*)  =  -i  rh(xy)dy  \ydr  f"fc( 

Jo  Jo     Jo 

leading  to  x*f2(x)  =  -  *  f  °/(a;)  *a(£/«)  dg, 

Jo 


where 
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Since  R~1E  —  /,  we  get  the  inversion  formula 


T 

Jo 

•r 

Jo 


where 


Since  $(s)  £)(!  —5)  =  1,  it  is  easy  to  see  that  the  new  kernels 

$!(*)  -  ft(a)  $(!-*)(-*),     &(*)  -  5B(2  -*) 
satisfy  ^(a)  $^(1  -a)  =  1. 


These  inversion  formulae  may  be  considered  generalizations  of  the 
Hilbert  transforms,  related  to  the  general  transformations  in  the  same 
manner  as  Hilbert  transform  to  cosine  transform. 

Further  interesting  results  are  obtainable  from  the  formulae  (33)  and 
(34)  for  the  conjugate  operator  $. 

If  the  realization  of  R  is  as  in  (39),  8  is  given  by 

—  2  C°° 

8f(x)  =  s(x)  =  -       ucosuxF(u)du, 


rx  2  f00  f 

so  that  I   s(T)dr  —  —\    sinuxdu  \ 

JO  TTjo  J 


The  last  expression  is  the  Hilbert  transform  of  /.  If  we  write  Tf  for  the 
Hilbert  transform  of/,  then 


and  from  (39)  Rf  =     T(xf), 

so  that  (33)  leads  to  the  formula 
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or,  since  T*  =  -  /,  Tx~Tf+xf==  -f. 

If  we  start  with  the  Watson  transforms,  as  in  (40)  and  (42),  we  get 

_       r°° 

sf  = 
Jo 

=  f 

Jo 

=  r 

Jo 

Then  f 

Jo 

l(y)  =  ^ 

8f  =  ^"2  f  "/( 
Jo 

say.  The  formula  (33)  now  gives,  with  (42), 


where 


Thus 

o 


and  so 


=  »  r^Cr-V1) 

Jo 

=ix-*  ri(ipr*) 

Jo 

=  .•  r^-i*-!)* 

Jo 


(43) 

The  analysis  leading  to  this  expression  has  been  purely  formal:  but  it  can 
be  made  rigorous  easily  enough  if  suitable  restrictions  are  laid  on  the 
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functions  involved.  In  any  case,  (43)  can  be  proved  by  observing  that  the 
Mellin  transform  of  the  first  term  on  the  left-hand  side  of  (43)  is 


while  the  Mellin  transform  of  the  second  term  is 

%(8)®i(l-  8)  2(2  -8)  =  $(8)8(1-8)  $(8-1)  (8  -1)8(2  -8)  $(8) 
-=(8-1)8(8). 

In  conclusion,  I  wish  to  express  my  sincere  thanks  to  Dr  H.  L.  Hamburger 
for  his  great  kindness  in  reading  this  paper  and  for  his  valuable  corrections 
and  suggestions. 
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THE  HYPERGEOMETRIC  IDENTITIES  OF  CAYLEY, 
ORR,  AND  BAILEY 
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1  .    Introduction 
Cayley*  in  1858  stated  that,  if 


oo         (c\ 

then         JT(o,6;c  +  i;a?)JP(c-o,c-6;c  +  i;a?)=  S  ,  \  ™  x  .4na?n.  (1) 

--- 


Using  the  notation!  A(fc)  =  T(d  +  h)/T(h)9  (2) 

we  can  write  this  identity  more  conveniently  as 


or  again,  in  a  notation  we  have  used  elsewhere  for  double  hypergeometric 
f  unctions  , 

--•  (4) 


Orr,§  discussing  the  differential  equations  satisfied  by  the  product  of  a 
pair  of  hypergeometric  functions,  gave  a  proof  of  Cayley's  identity  and 
added  the  two  analogous  results 


(5) 


*  (5),  (3),  268-269;  quoted  by  Bailey  (1),  84. 

f  This  is  at  variance  with  the  meaning  we  have  given  to  A(A)  in  (3),  250(5),  but 
confusion  is  not  likely  to  arise  here. 
J  See,  for  instance  (4),  §  1. 
§   (10),  1-15  [10(48'"),  9(42')];  quoted  by  Bailey,  (1),  85. 
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(6) 
With  these  we  may  associate  the  identity  given  by  Bailey,* 


A(2c-a-6  +  4)     F    a,b;       "I         -a  +  ic-6  +  J; 


"I 
J 


which  is  identical  with  (5)  when  c  —  a  +  6  +  J. 

In  this  note  the  differential  equations  satisfied  by  the  subjects  of  these 
identities  are  again  considered  though  with  a  technique  which  seems 
simpler  than  that  employed  by  Orr.  We  obtain  some  additional  identities 
of  similar  character  and  then  show  that  these  identities  are  closely  linked 
with  certain  expansions  of  hypergeometric  functions  that  may  be  thought 
of,  loosely,  as  "duplication  formulae". 

2.    The  product  (l-x)~d  F(a,b\  c;  x) 

The  product  y  =  (1  -x)~d  F(a,  b;  c;  x)  (8) 

satisfies  the  differential  equation 


--=  0. 

This  is  (7),  (21)  or  (49),  but  direct  proof  is  simple,  since  (1—  x)dy  satisfies 
the  ordinary  hypergeometric  differential  equation.  The  equation  is  con- 
veniently "factorized"  as 

[d  +  c-l~x(S+a  +  d-l)][S-x(8+b  +  d)]y  =  (a-c)bxy.  (9) 

Now,  in  the  general  equation  of  rank  2 


the  substitution  T(d  +  c)y  =  T(d  +  a)z  (10) 

"exchanges  "  the  end-factors  $+c~  l,#-fato  give  the  differential  equation 

[(£  +  a  _  i  )  /(#)  _  x  g(§)  +  x*(8+  c)  h(d)]  z  =  0. 
This  is  readily  seen,  for 


*   (2),  378,  theorem  I;  quoted  in  (1),  86. 
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From  the  symmetry  on  the  right  and  (10),  it  is  clear  that  interchange  of 
c,  a  interchanges  y,  z.  This  is  essentially  the  transformation  referred  to  in 
(7),  59  as  "augmenting  (varying)  the  coefficients".  Thus,  if  in  (9)  we  write 


we  get  the  equation 

[d+b  +  d-l-x(3  +  a  +  d-l)][d-x($+c)]z  =  (a-c)bxz.          (11) 
This  differs  (formally)  from  (9)  only  in  the  change  of  parameters 
a,  b,  c,  d-*d,  c  —  a,  b  +  d>  a. 

~  (12) 

We  replace  the  F-operators  by  A-operators,  since  we  have  now  to  secure  the 
identity  of  the  functions,  not  merely  of  the  differential  equations.  We  get 
the  more  symmetrical  form  of  (12) 


Hence     A(c)  (l-^-^P^  x]  =  &(b  +  d)(l-x)-*F[d'*~£l  afl. 


,      (13) 

if  we  make  the  change  a,d->a  +  c,d  —  b  and  use  Euler  Js  identity  (25)  below.  * 

Remembering  that,  if  the  values  of  the  parameters  are  suitably  restricted, 

we  have 


we  can  rewrite  (13)  as 

C)    ' 


fluc-i(l-u) 

.     (14) 


xu 


Replacing  the  hypergeometric  functions  by  their  Eulerian  integrals  we 
get  the  identity  of  double  integrals 


n 
o 


=  (\-x)b~«  r  f  lU*-*-*(I  -  U)d~c-1  Vb~l(l  - 
JoJo 

This  is  established  directly  by  the  substitution 


(1— 


*  Direct  identification  of  coefficients  in  (13)  gives  an  identity  between  aF9's  of 
it  argument,  which,  in  Whipple's  notation,  is  ^(0;  4,5)  =  Fp(l;  4,  5);  cf.  Bailey 


uni 

(1),  18,  Table  HA. 
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We  can  give  an  independent  proof  of  (14)  by  expanding  the  hypergeometric 
function  on  the  left  so  as  to  express  the  left-hand  side  (formally)  as 


f/H(6)na?n  |*1ttc+w-1(l  -u)*-*-1  (l-xu)b~ddu 
l(c)n          Jo 


/ 
nl(c)n 


^ 

)n  \      d  +  n        ] 


by  Euler's  identity.  Finally,  we  have  by  Vandermonde's  theorem,  as  in 
(6),  (7),  that 


|  (a  +  cU6)       ^/ft  +  n.J-c;    \  ^  j./M  +  d;    \ 
=0     n!(d)n  \^  +  n         /          \      d         } 


3.    T^e  Saalschutzian  product  (l—x)~f3F2 

The  product  y  =  (1  -  x)-/B^ra'  6'  C;  x\  (16) 

can  be  expanded  as 


If  a-f6  +  c+/=^  +  e,  (17) 

the  4^  is  Saalschutzian  and  we  then  extend  the  term  to  the  product  (16) 
itself.  Under  this  Saalschutzian  condition  (17)  the  differential  equation 
satisfied  by  (16)  is  of  rank  2  only.  By  (7),  68,  (59)  this  equation,  after 
"factorization",  is 


=  -(d-a)x[(d-a+l)8+bc  +  df]y.         (18) 
The  substitution  T(8+d)  y  =  r(*+  b  +/)  z 

"exchanges"  the  factors  3+d--lt  d  +  b+fto  give 


=  —  (d  —  a)x[(d-a+ 
In  form  this  can  be  got  from  (18)  by  the  change  of  parameters 

a,  6,  c,  d,  e,/->e  —  c,  6,  e  —  a,  b+f,  e,  d  —  b. 
Hence  we  have 
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This  can  be  written  more  symmetrically  with  changed  parameters  as 

,     (20) 


where  the  Saalschutzian  condition  is  now  a  +/  =  c  +  d. 

As  in  §2  above  we  can  express  this  as  an  identity  connecting  definite 
integrals  of  which  independent  proofs  can  similarly  be  given.  The  identities 
(12),  (19)  are  possibly  new. 


4.    The  Saalschutzian  product  ZF±  x 
We  can  expand  the  product 


When  «    a  +  a'H-6  +  6'  =  c  +  c'  =  2A;  (say),  (22) 

the  4.F3  is  Saalschutzian,  a  term  that  again  we  extend  to  the  product  itself. 
With  this  condition  (22)  the  product  satisfies  a  differential  equation  of 
rank  2,  given  in  (7),  75,  (91)  as 


.     (23) 
We  interchange  the  pair  of  factors  3+c—l,  8  +  b  +  b'  by  the  substitution 


and  the  pair  of  factors  3  +  cf  —  1,  $  +  a  +  af  —  1  by  the  substitution 

T^+c')^-  r(3  +  a  +  a')z. 
Employing  these  substitutions  simultaneously,  so  that 

r(8  + 
we  get  the  equation 


4(a-c)bx(8+k-l)($+k-%)(3+k)z. 
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It  is  soon  seen  that  this  equation  is  obtained  from  (23)  by  the  change  of 
parameters 

a,  6,  c;  a',  6',  c'->6',  c  —  a,  6  +  6';  c'  —  6',  a, 


which  leaves  unaltered  the  Saalschiitzian  condition  (22).  We  thus  have  the 
identity 


=  A(a+a' 
subject,  of  course,  to  the  Saalschiitzian  condition 


The  symmetry  latent  in  this  result  can  be  brought  out  by  use  of  Euler's 
identity 

a,y-p\y\x).  (25) 


Applied,  for  instance,  to  the  right-hand  side  of  (24)  it  turns  the  F  x  F  into 

,c'-a' 
6  +  6' 


and  so  on. 

If  we  pick  out  the  coefficients  of  xn  on  the  two  sides  of  (24),  we  get  the 
identity  between  terminated  Saalschiitzian  4^3's  with  unit  argument:* 


n  '-b',a}l~b-bf^n,  -n 

4    3  '       -'  -  -  -         '       V       } 


We  can  express  (24)  as  an  identity  between  triple  integrals  in  the  form 

m^-lM  _  fia+a'-c-l  %«-!(!  -  ^)c~«-l  i;«/-1(l  -  w)c'-a'-l 
o 

x  (1  -  ta#)-&  (1  -  toa;)-6'  dtdudv 

=  f1  f  X  f  1Tb+b'-1(l  -  T)c'-b-b'-1  Uc-a~l(l  -  U)c'~a'-1  Va~l(l  -  V)a'~l 

x(l-TUx)-b'(l-TVx)b'-«'dTdUdV.     (27) 
This  can  be  verified  directly  by  the  substitution 
1-*-  - 


In  Whipple's  notation  this  is  the  identity  >S(1,  2,  3)  =  ^(2,  4,  6);  see  Bailey  (1),  §  7-3. 
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5.   Bailey*  a  identity 

If  in  the  equation  (23)  we  write  a  +  a'  =  6  +  6',  so  that  with  the  Saal- 
schiitzian  relation  (22)  we  have 

a  +  a'  =  6  +  6'  =  kt     c  +  c'  =  2k,  (28) 

we  can  remove  a  left-hand  operator  S  -f  k  —  2  and  rewrite  the  equation  (now 
of  the  fourth  order)  as 


If  now  on  the  left  we  exchange  the  last  operator  $+k  with  one  or  other 
of  the  operators  S+c—  1,  #-f  2fc  —  2,  we  can  remove  a  further  operator 
8+  k  —  1  from  the  left.  That  is  to  say,  if  we  write 


(i)  r(d  +  c)y=  r(S  +  k)z1}        (ii) 
we  get  the  respective  third-order  equations 
(i) 


(29) 
(ii)     [8  +  c-l-x(S  +  k  +  a-b-l)] 

x[d(d+2k-c-l)-x(S+2k-I)(d  +  k-a  +  b)]zz 

=  4(a  -  c)  bx(8  +  k  -  J)  z2.     (30) 

We  can  identify  the  expressions  on  the  left  of  these  equations  with  the 
expression  on  the  left  of  (  1  8)  above  if,  in  (  1  8),  we  make  the  respective  changes 
of  parameter 

(i)  (ii)  (i) 

2k-l, 
2k  —  c, 


Each  substitution  alike  gives  on  the  right  of  (18) 

-x{(b  +  c-aY-$(8+k-\)y, 
so  that  to  complete  the  identification  of  (  1  8)  with  (29)  or  (30)  we  need  further 


i.e. 
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We  therefore  have  the  identities 


A(2fc-l)      ra,6;    1      r*-a,*-6; 

2/fc-c 


when  c  is  given  one  of  the  values  a  +  b  ±  J.  These  two  identities,  we  note  in 
passing,  are  identical  when  2k  =  c  -f  1  -  If  we  equate  in  them  the  two  expres- 
sions for  J?i  x  %Flt  we  get,  not  unnaturally,  an  identity  included  in  (19). 
In  (i)  choose  the  value  c  =  a  +  b  —  J  and  make  the  change  of  parameters 


a,  6,  &->c  —  a-h  J,  c  —  6+  J, 


so  that  consequently  c-»2c  —  a  —  6  +  J.  Then  we  at  once  have  Bailey's 
identity  (7). 

If  in  (ii  )  we  choose  the  same  value  of  c  and  make  the  same  change  of 
parameters,  we  get  the  identity  (which  may  be  new) 


A(2c)  a,b; 


Choice  of  the  alternative  value  a  +  6  +  J  for  c  gives  essentially  the  same 
identities  in  changed  parameters.  It  is,  in  fact,  equivalent  to  a  transforma- 
tion of  the  products  2f\  x  2-?i  by  Euler's  identity,  for,  under  the  Saalschiitzian 
condition  a  +  a'  +  b  -h  6'  =  c  -h  c',  we  have 

c/-a/,c/-6';    1       /00. 

,     f  *-  (32) 


Identification  of  the  coefficients  of  xn  on  the  two  sides  of  (31)  gives  the 
identity  of  terminated  4^3's  of  unit  argument 


T      a-b  +  ^b-a  +  ^c,  -n      1 
4  *l2c~-a-b  +  $,a  +  b  +  %>l-c-nj 

(g)n(6)n(2c)n 
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6.    Orr's  identities 
Orr's  first  identity  (5)  comes  at  once  from  (29)  by  writing 


For  we  then  get 

[$+2c-~l-x(d+c  +  a-b~$)][d-x(8  +  c-a  +  b  +  %)]u  =  4(a  -  c)  bxu. 
But  this  is  the  equation  we  get  from  (9)  by  the  change  of  parameters 

a,  6,  e,  d-+2a,  26,  2c,  c-a-6  +  J. 
This  proves  (5). 

We  can  deduce  (6)  from  (5)  by  using  (24).  For,  in  (24),  make  the  Eulerian 
transformation  (32)  on  the  right  and  put  c'  =  c+l,  a  +  a'  =  6  +  6'  =  c-f-^. 
This  gives  us 


With  (5)  we  get 


which  is  (6)  with  a  +  J,  c  +  J  written  for  a,  c. 

We  can  put  this  analysis  in  another  way  in  deriving  (6)  directly  from 
(23).  For  put 

a  +  a'  =  c  +  •£.,     6-f6'  =  c  —  J,     c  =  c'  =  fc. 
Then  (23)  is 


Remove  the  left-hand  factor  (J  +  c-  1,  operate  with  r(£-hc—  2),  and  write 

r(^+c-t)2=r(5  +  c)y.  (34) 

We  can  then  remove  a  left-hand  operator  F(£4-  c  —  J),  leaving 


Reference  to  (9)  above  shows  that  this  is  a  differential  equation  satisfied  by 

z  =  (1  -x)a^-c-^2Fl(2a-  1,  26;  2c-  1;  x). 
This  with  (34)  gives  (6). 
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7.    Cayley's  identity 

In  Cayley's  identity  the  product  2^  x  2F±  is  no  longer  Saalschiitzian  and 
it  therefore  escapes  the  net  of  the  foregoing  analysis.  It  is,  however,  deducible 
from  the  pair  of  Orr's  identities.*  We  can,  in  fact,  go  a  little  further  and 
make  it  depend  on  either  of  Orr's  identities  by  a  suitable  operation  of  the 
type  8-\-h  that  increases  by  one  the  parameter  difference 


and  restores  the  Saalschiitzian  condition.    Operation,  for  instance,  with 
d  +  a  —  b  -f  c  will  give,  on  the  right  of  (3), 


r<J+l,&;    1  rTc  —  a,c  —  b; 

«| 

and,  on  the  left  of  (3), 


-^^-^Pa+21c'26;  J  +  (c-a- 

(36) 
Using  (6)  we  can  equate  (35)  to  the  expression 


and  a  little  manipulation  then  identifies  this  with  (36).  Operation  with  $ 
or  with  £+2c+l  similarly  contrived  will  bring  into  play  Orr's  other 
identity  (5). 

From  its  simplicity  and  symmetry  of  form  Cayley's  identity  seems  to 
deserve  a  better  status  than  that  of  a  corollary,  f  but  much  fruitless  effort 
has  driven  us  to  see  in  it  an  exceptional  theorem  dependent  on  the  precise 
values  of  the  parameters  and  accordingly  without  obvious  extensions. 

*  See,  for  instance,  Bailey  (1),  §  10-1,  86. 

f  Moreover,  in  the  duplication  formulae  that  follow,  it  is  the  series  derived  from 
Orr's  identities  that  appear  as  corollaries  to  the  series  derived  from  Cayley's  identity. 
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The  following  direct  proof  may  be  of  interest.* 
Write 


=     rc  +  i;a,&;o-«,c-6;        1 

L   c;    c  +  J;      c  +  i  J 


so  that  y  satisfies  the  differential  equations 

®y  =  0,     %  =  0, 
where      e  =  #(#  +  c--|)  (£  +  c-  l)-x(0  +  a)  (0  +  b)  (3  + 

Q>  =(})((£  +  c-  %)  (S  +  c-l)-x(<t>  +  c~a)(<f>  +  c-b)  (8  + 

Here  $  is  d  operating  on  the  first  factor  only  in  y,  <f>  is  S  operating  on  the 
second  factor  only,  and  therefore  0  +  <j>  is  8  without  restriction.  We  now 
have  to  prove  that 

y  =  (l-x)a+b~cF(2a,  26;  2c;  x). 

Subtraction  and  reduction  gives 

e-<X>  =  (£  +  c-i)(£  +  c-l)[0-0-3;(0-0  +  a  +  6-c)].  (37) 

Hence  [0  -  $  -  x(0  -  <f>  +  a  +  b  -  c)]  y  =  0, 

since  y  contains  only  positive  integral  powers  of  x.  Thus 

(l-#)2%=  [S-xtf+a  +  b-cfty.  (38) 

Write  y  =  (l-x)«+o-cz,  (39) 

so  that  we  want  z  —  F(2a,  26;  2c;  ic), 

and  remember  that  always 

[S-x(d  +  h)](l-x)hw=  (l-xY^dw.  (40) 

Then  (38)  gives  2%  =  (  1  -  x)a+b~c  Sz.  (41  ) 

Recalling  that  6x  =  ic(<9+  1)  in  0,  but  Ox  =  xO  in  $,  while  8x  =  x(d+  1) 
in  both,  we  can  write 


,     by  (37). 


*  This  (received  28  February  1945)  replaces  the  much  more  cumbrous  original 
analysis.  We  are  indebted  to  the  referee  for  suggesting  its  omission,  and  for  thus 
spurring  us  to  a  further  investigation. 
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Hence  again 


i.e.  [(*+2c--l)--a;((*  +  a  +  6  +  c)]0y  =  2abxy. 

Thus,  from  (39),  (41), 


and  so,  with  (40), 

i.e.  \d(8  +  2c-l)-x(d+2a)(S+2b)]z  =  0. 

Thus  z  =  F(2a,  26;  2c;  x)  and  the  proof  is  complete. 

8.    Some  hypergeometric  expansions 

Most  of  the  foregoing  identities  can  be  made  to  yield  expansions  of  a 
certain  type  by  use  of  the  following  lemma  : 

LEMMA.   If  F\     '  "  '  x    ,  F\    ,  '  "  "  '  x\  are  any  two  hypergeometric  functions 
[_c,  ...;    J       |_c  ,  ...;    J 

(of  any  order)  and  h,  k  any  two  suitable  constants,  then 

r»,  ...;   1  Fro',  ...;   ~1  _  *  _J^i^Ak^ki-^kii^_  ^ 
U-.;    J     LC',...;    ]~r^r\(k  +  r-l)r(k)2r(c)r..,(c')r... 


where  k(h)  =  T(8  +  h)IT(h),     etc.  (42) 

Proof.   We  have 

A(A)     Pa,  ...;-!          ni,a,...;    "I 

A(t)-*U...;  _l~    Uc,  -..;  J 

and  so,  as  usual,* 


Thus  the  series  on  the  right  of  (42)  can  be  written 


1  )r  (0  +  k)r  ($  +  k)r 


where  (as  in  §  7  above)  6,  <j>  act  respectively  on  the  first  F  alone  and  on  the 
second  F  alone. 


*  Cf.,  for  instance,  (3),  §  1,  250. 

E2 
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We  have  therefore  to  prove  that 

= 
~6    4 


But  this  is  a  corollary  of  Dougall's  theorem.*  As  quoted  by  Bailey  we 
make  the  change  of  parameters 

a,  c,  d,  e-*k  —  1,  k  —  h,  —0,  -~<j), 

and  this  completes  the  proof  of  the  lemma. 
Applying  the  lemma  to  (12)  above  in  the  form 


we  get  (after  a  little  cancelling) 


r;    ~|    _  rd  +  r,c-a  +  r;    1 

X      2^1  ^  ^       • 

c  +  2r  J       L        c  +  2r  J 

Euler's  transformation  performed  on  the  first  two  F'a  reduces  this  to 
_   »  (c-a)r(a)r(d)r(c-b-d)r 

""-^ 


which  is  exactly  our  (4),  114,  (16)  in  different  parameters. 

We  can  similarly  apply  the  lemma  to  (19)  having  first  moved  A(c?)  to 
the  right.  The  factor  (1  —  x)b~d  on  the  right  gives  rise  to  the  hypergeometric 
function  F(b  +/+  r,  d  —  b  +  r  ;  d  +  2r;  x)  which  by  Euler's  identity  is  equal  to 

(1  -x)-fF(b  +  r,d-b-f+r;  d  +  2r;  x). 

Cancelling  the  (  1  —  x)~f  on  the  two  sides  and  noting  that  d  —  b  —  /  —  a  +  c  —  e 
by  the  Saalschutzian  condition  (17),  we  have  finally 


-e)r(e-a)r(e-c) 


;    "1       p  +  r,e-a  +  r,e-c4-r;    1 
a:  Iq^o  ,  a?  I  • 

J3  2L           d+2r,e  +  r  J 


(44) 
v     ' 


*  (9),  122  (9);  or  Bailey  (1),  27,  §4-4(1). 
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Applying  similar  technique  to  Bailey's  identity  (7)  we  get 
2a,26,c; 


;      1       "  (a  +  b-  c)r  (a)r  (b)r  (c  -  a  +  |)r  (c  -  b  +  j)r   2r 
KJ     rii        r!(a  +  &  +  iUc  +  r-i)r(c  +  i)2r 

r,&H-  r;    "1   ,.  r2c-a-6-f  r  +  -J,a  +  r,6-fr;    ~] 

«  i       #       1^2  r,  1  i  1  *       • 

4-2r  +  J     J3  2L      c  +  2r  +  4,a  +  6  +  r  +  i         J 


The  analogous  identity  (31)  does  not  yield  a  result  of  interest.  We  may 
note  that,  if  in  (45)  we  make  c->oo,  then,  formally  at  any  rate, 


The  coefficients  here  can  be  verified  by  use  of  Saalschiitz's  identity  and 
the  duplication  formula  of  the  gamma  function. 

9.    Duplication  formulae 

The  method  of  the  preceding  section  applied  to  Cayley's  identity  (3) 
gives  the  expansion 

2a,2b;    -I        »(-i)r(a)r(6)r(c-a)r(c-6)        /      Ta  +  r.ft  +  r; 

~- 


(47) 

which  may  be  regarded  as  a  "duplication  formula"  of  the  hypergeometric 
function.  It  should  be  compared  for  contrast  and  similarity  with  our 
"duplication  formula"  (4),  115,  (24): 


\    ~]    ^ra  +  r,6  +  r;       1 
x\  2^1  «        —  a;    . 

J2  1L     c  +  2r  J 


Operation  on  (47)  with  1  +  #/(2c  —  1)  and  with  1  +  $/2a  gives  respectively 
M6; 


1,26;    "I        «  (-l)r(a+l)r(fc)r(c-q)r(e-fc) 
~ 
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These  are  the  series  derived  from  Orr's  identities  (5),  (6)  by  the  method  of 
§  8.*  Here  it  is  to  be  noted  that  the  formulae  derived  from  Orr's  identities 
appear  as  subsidiary  to  those  derived  from  Cayley's  identity.  The  series 
(47)  converges  when  |  x  \  <  1,  since  by  (3),  267,  (77), 


F(a  +  r,  b  +  r  ;  c  +  2r-3  x)  <  T-yr  F(a>  6;  c;  *)> 

where  we  sufficiently  regard  a,  b,  c,  x  as  positive. 

Expansions,  reciprocal  to  (47),  (48),  (49),  can  be  given  as  follows: 

ra,b-    ~»_   -  (l)r(a)r(b)r(c-a)r(c-b)  a  +  2r,  26  +  2r; 

" 


(50) 


V    V  2 /r  V^/r  V^/r  \^      **  IT  W_    _ 


(51) 


_  ,r'7*  P  , 

"  -  *    aA  *'    (52) 


Of  these  (51),  (52)  can  be  obtained  from  (50)  by  operation  (once  more)  with 
1  +  3/(2c  -  1  ),  1  4-  £/2a,  respectively. 

We  can  deduce  (50)  from  (47).  For,  if  on  the  right  of  (50)  we  replace 
F(2a  +  2r,  2b  +  2r;  2c  +  4r;  x)  by  the  series  in  (47),  taking  N  —  r  as  the 
parameter  of  summation,  we  get  after  some  rearrangement 


c  +  2^ 

1,      -JV     "1 


By  the  corollary  to  Dougall's  theorem  already  quoted,  this  5-F4  can  be 
expressed  as  a  product  of  gamma  functions  which  includes  F(l  —  N)  in  the 
denominator.  The  5F±  thus  vanishes  unless  N  =  0.  The  outer  sum  therefore 
reduces  to  its  first  term  [F(a,  6;  c;  x)}2,  and  the  proof  of  (50)  is  complete. 
Conversely  we  can  deduce  (47)  from  (50)  by  a  similar  argument. 


*  We  have  to  make  the  changes  c  ->•  c  —  £  in  (48)  and  a,  c->a+l,c  +  ^in  (49). 
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With   the   foregoing   "duplication   formulae"   we   may   associate   the 
somewhat  similar  pairs  of  reciprocal  series 


(53) 


(55) 
;  1 

^     *  J  ' 


Of  these  (53),  (54)  are  merely  our  (3),  (50),  (51)  with  y  put  equal  to  x.  We 
can  deduce  (56)  from  (47)  by  substituting  for  (2.^i)2  from  (54)  in  (47),  re- 
arranging as  in  the  proof  of  (50)  above,  and  using  a  corollary  of  Dougall's 
theorem.*  Similarly,  (55)  can  be  deduced  from  (56)  or  from  (53),  (50). 
Lastly  we  note  the  identity 


(57) 

which  reduces,  when  c  =  a  +  b  4-  \  \,  to  Gauss's  quadratic  identity.  To  prove 
(57)  we  quote  (4),  115,  (21),  (23)  in  the  forms 

"•> 


and  substitute  from  (58)  for  the  2FX  on  the  right  of  (59).  Writing  N  —  r 
for  s  and  rearranging,  we  then  have 


«   (a)N(2b)w   2N       r~2a  +  2N,2b  +  2N;    "1       Tc-a,  -N~] 

2^         -——.^ __  aj        2^1 1  ^    i    o  \r  ^   I2^l|  j,    ,    1  J 


by  Vandermonde's  theorem. 


*  Precisely  it  is  the  one  given  by  Bailey  as  (1),  28,  (3). 
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10.    A  multiplication  formula 
We  conclude  with  the  "multiplication  formula" 


=  v    _J?y6k(Z)r_    ^pM-c-r,  -rl 
r-or!(c)r(c  +  r  +  r-l)r8  1    oM-a-r    J 


By  comparing  coefficients  ofxn  on  the  two  sides  we  see  that  it  is  equivalent 
to  the  identity 


^fa,  1  —  c  —  r,  —  r~]    „  f/?,  1  —  c  —  r,  —  r"l 
Xa^i  LjPo  ,      ,  • 

L    7,1-a-r    J       L    7,1-^-r    J 


To  prove  this  pair  of  formulae  notice  first  that  operation  with  d  +  a  4-  a 
on  (60)  gives 


^f^6;    1    T,r<*+l>fc    1 
1L   c  ^J^L       7          J 


'  i 

»    • 
J 

If  we  substitute  from  (60)  for  the  two  products  2Fl  x  JF±  on  the  left  of  this, 
we  get  an  identity  so  long  as 

-.Fa,  1  —  c  —  r,  —  r~l          T^Fa+  1,  1  —  c  —  r.  —  r~] 
3jP2  H-a3jpi  i 

L    7>  —  a~  r     J  L       7,1—  a  —  r       J 


which  is  just  the  identity 

J'rfll'a2:a3 

C1>C2  J  L       CUC2 
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when  x  —  1.  Rearranging  the  argument  we  see  that,  if  (60)  is  true  for  some 
6,  c,  a,  ft,  y  and  all  a,  it  is  true  for  6,  c,  a  4- 1,  ft,  y  and  so  on.  The  operation 
3  +  a  +  a  merely  multiplies  a  term  xtl  by  n  +  a  +  a;  we  can  therefore  transfer 
the  argument  to  (61)  for  some  fixed  n.  If  we  multiply  (61)  through  by 
(a  +  ot)n,  we  get  a  (presumed)  identity  in  a  of  order  not  exceeding  n.  Thus, 
by  the  argument  above,  if  it  is  true  for  some  a  (and  alj  a),  it  is  true  by 
induction  for  a 4- 1,  a +  2,  ...,  a  +  n,  ...,  i.e.  for  more  than  n  values  of  a, 
and  so  for  all  a. 

Now,  if  in  (61)  we  put  a  =  0,  /?  =  0,  we  get 


(c)r         rt'o 

+  i,y,      -n;       _  "1 
-ic,  c+y+rc;        J' 


and  this  is  true  by  the  corollary  of  Dougall's  theorem  just  quoted.*  Thus 
(61)  is  true  for  a  —  0,  /?  =  0,  all  y,  a,  b,  c  and  any  fixed  (positive  integer)  n. 
Hence  by  our  argument  it  is  true  for  ft  =  0,  all  a,  y,  a,  b,  c  and  fixed  n.  There 
is  symmetry  between  a,  /?  and  we  can  therefore  argue  similarly  in  /?,  so  that 
now  (61)  is  proved  for  all  a,  /?,  y,  #,  b,  c  and  every  positive  integer  n.  The 
proof  of  (61),  and  consequently  of  (60),  is  then  complete. 

In  (60)  put  (a,/?,  y)  =  (a,  6,  c).  The  first  3J^2  on  the  right  is  then 


["-r,a,  1-c-rl 
^     i-a-r,c    J' 


By  Dixon's  theorem j*  this  reduces  to 

r(l-^)F(l-r-a)r(c 
~ 


(62) 

This  vanishes,  if  r  is  odd,  because  of  the  infinite  factor  F(l  —  r)  in  the 
denominator  (or  we  can  change  the  sign  of  the  3J^  by  reversing  the  order  of 
its  terms).  When  r  is  an  even  integer  2n,  we  can  rewrite  (62)  as 

(n)n(c-a)n 


Dealing  similarly  with  the  other  3F2  and  cancelling  factors  on  the  right,  we 
again  get  the  "duplication  formula"  (50). 


*  (1),  28,  (3). 

f  (8),  289;  quoted  by  Bailey  (1),  13,  (1). 
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THE  SELF-POLAR  RIEMANN  COMPLEX  FOR  A  F4 

By  H.  S.  RUSE 
[Received  29  October  1944 — Read  16  November  1944] 

This  paper  follows  three  others  (Ruse,  1944 a,  b,  1946). |  It  deals  with  the 
case  when  the  Riemann  tensor  at  a  point  P,  (x1),  of  V±  defines  in  the  associated 
projective  space  $3  a  quadratic  complex  of  lines  which  is  self-polar  with 
respect  to  the  fundamental  quadric. 

Some  of  the  results  in  the  present  paper  are  given  in  one  form  or  another 
by  Struik  (1927-8),  Lamson  (1930)  and  Churchill  (1932).  The  whole  theory 
is  contained  explicitly  or  implicitly  in  works  on  classical  projective  geo- 
metry, and  it  has,  moreover,  close  connexions  with  spinor  analysis  and  the 
theory  of  groups.  For  the  latter  facts  no  apology  is  made,  the  whole  purpose 
of  the  paper  being  to  obtain  results  in  Riemannian  geometry  by  making  as 
wide  an  appeal  as  possible  to  classical  projective  geometry  and  group  theory*. 

1.    Notation  and  preliminaries^ 

Let  T4  be  the  affine  tangent  space  at  P,  (xl),  of  J^,  and  $3  the  projective 
3-space  at  infinity  in  T±.  Then  the  equation  of  the  Riemann  complex  in  8Z  is 

JWV  =  o.  (i-i) 

Here  pij  ==  Xi  Y?  —  YiXj  are  the  Pliicker  coordinates  of  the  line  of  S3  joining 
the  points  Xi,  Yl,  and  are  the  components  in  V±  of  the  simple  bi  vector  defined 
by  the  contravariant  vectors  X1,  Y{  at  P. 
The  dual  equation  of  the  complex  is 


!  =   0, 

where  Qpti  =  \Gijklpklt  (1*2) 

P«>  (1-3) 


These  papers  will  be  referred  to  as  1,  2,  3  respectively. 

A  more  detailed  account  of  the  notation,  etc.,  is  given  in  papers  1,  3. 


76  H.  S.  RUSE  [16  Nov. 


the  e's  being  the  dualizing  tensors  of  non-zero  components   ±  ^g,   ± 
respectively.  The  normalizing  factors  J,  J  are  inserted  for  convenience. 

The  fundamental  tensor  defines  in  S3  the  fundamental  quadric  of  point- 
and  tangential-equations 


respectively.  The  Riemann  complex  is  self-polar  with  respect  to  this 
quadric  (1,  §  6)  if 


where,  as  shown  in  1,  the  scalar  AC  must  be  equal  to  ±1.  If  K  =  +  1,  then 


whence  R^  —  ^g^R,  (1*5)1 

so  that  V^  satisfies  the  Einstein  condition  at  (xl).  Here  R^  is  the  Ricci  tensor 
denned  by  Rtj  =  gmnRimnj.  If  K  =  -  1,  then 


whence  Rijkl  =  -^(gik^i  +  fJjiRik-(Jii^k-9ikRn)^  f1'7) 

and  also  R  =  0.  (1-8) 

It  is  easy  to  see  that  (1*8)  is  a  necessary  consequence  of  (1-7). 

When  the  Riemann  complex  satisfies  (1-4),  or  equivalently  (1-5),  it  is 
called  self  -polar  of  the,  first  kind]  and  when  it  satisfies  (1*6),  or  equivalently 
(1-7)  and  (1-8),  it  is  called  self  -polar  of  the  second  kind.  The  former  is  by  far 
the  more  interesting  case,  and  it  is  mainly  with  it  that  the  present  paper 
deals. 

Even  when  the  Riemann  complex  is  not  itself  self  -polar,  the  tensor  can 
be  expressed  as  a  sum  of  two  parts  which  respectively  represent  self  -polar 
complexes  of  the  first  and  second  kinds.  Thus,  if  we  put 


where  Qijkl  =  i(^A-z 

following  Churchill  (1932),  then  G^u  satisfies  (1-4)  and  Eijkl  satisfies  (1-6). 

Raising  and  lowering  suffixes  by  means  of  the  fundamental  tensor  corre- 
sponds geometrically  to  taking  the  polar  reciprocal  in  $3  with  respect  to 
the  fundamental  quadric;  raising  and  lowering  pairs  of  skew  suffixes  by 
means  of  the  e-tensors  (indicated  notationally  by  °)  corresponds  to  taking 
the  dual  coordinates  of  the  geometric  object  represented  by  the  operand. 
The  two  operations  are  commutative. 
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A  skew  tensor  (bivector)  a^  represents  in  SB  the  linear  complex  of  dual 
equations!  «^«  =  0,  «««°p«,  =  0. 

A  co  variant  vector  ri  represents  the  plane  of  equation  riXi  =  0. 

Ennuplet  coordinates  Xa  in  &,  are  defined  by  taking  at  P,  (#*),  in  V± 
an  arbitrary  orthogonal  ennuple  h%==(h%,  ...,Af),  a,b,  ...,g  being  used  for 
ennuplet  suffixes  and  i,  j,  ...  for  tensor  suffixes.  The  /&'s  satisfy  the  usual 
relations  gabh<m  =  gti,  h'ah?  =  dj,  tf«A?Aj  =  g*. 

etc.,  where  gab,  gab  are  both  equal  to  the  Kronecker  delta  8ab.  When  the  &'s 
define  real  directions  of  T^  and  the  metric  is  not  positive-definite  at  P,  some 
of  them  are  purely  imaginary;  for  example,  if  the  signature  is  (+  +  H  —  ), 
then  the  h%  are  real  for  a  —  1,  2,  3  and  imaginary  for  a  =  4. 

The  ennuplet  coordinates  of  a  point  and  of  a  plane  in  S3  are  defined  in  the 
usual  way  by  the  equations 

X°  =  h?X<,     ra  =  hir(, 

and  are  scalars  for  transformations  of  the  coordinates  (x1)  in  V±,  but  trans- 
form linearly  under  rotations  of  the  ennuple.  The  points  whose  coordinates 
in  the  JC^-system  are  h{,  .  .  .  ,  h\  are  the  vertices  of  the  tetrahedron  of  reference 
of  the  ennuplet  system  Xa.  The  non-zero  components  of  the  ennuplet 
dualisors  eabcd,  eabcd  are  ±  1  ,  and  the  ennuplet  equation  of  the  fundamental 
quadric  is  -b  ...  +  (X')*  =  0.  (1-9) 


A  linear  complex  of  ennuplet  coordinates  qab  may  be  represented  in  the 
usual  way  by  a  point  (ql,  ...,  #6)  of  a  projective  $5  by  writing 

gl    =    ^23  g2    =    ^31  ^3    =    gl2    -j 

i  V  tf  1  V  V     ,1  (1>1()) 

q*  =  qu,     q*  =  q™,     q*  =  q™,) 
or,  as  a  single  equation, 

either     qa  =  \ylbqab     or     <fb  =  'tf?q*  (1-11) 

(see  3,  §3).  Greek  suffixes  will  always  run  from  1  to  6  and  will  refer  to  $5, 
while  Latin  suffixes  will  continue  to  run  from  1  to  4  and  will  refer  to  $3  or 
to  V±.  As  it  stands,  the  transformation  amounts  simply  to  the  replacement 
of  pairs  of  skew  Latin  suffixes  by  single  Greek  suffixes  according  to  the 

scheme  «6  -  23     31     12     14     24     34) 

[.  (1-12) 

a=123456j 

|  In  discussing  an  arbitrary  bivector,  it  is  a  matter  of  taste  or  convenience  whether 
the  covariant  tensor  be  regarded  as  primary  and  the  contravariant  as  the  dual,  or 
vice  versa. 
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It  is  to  be  noticed  that,  if  y£&  an(*  7?  in  (1-11)  are  regarded  as  tensors  of 
the  types  indicated  by  their  suffixes,  then,  if  desired,  the  coordinates  of  $3 
and  $5  may  be  transformed  independently  of  one  another.  In  particular, 
(I'll)  may  be  written 


where  7&  =  ySWj,     7?  = 

When  the  transformation  has  the  simple  form  (1-10)  the  coordinates  #a 
are  called  basic,  the  orthogonal  ennuple  hla  being  the  basis. 
The  lines  (or  special  complexes)  p^  of  S3  satisfy  the  identity 

K«PV=0  (l-H) 

and  so  correspond  in  8&  to  the  points  of  the  4-quadric 

eaftp*pt  =  0, 

where  ea/?  =  efta  =  K^7?r?-  (M5) 

This  will  be  called  the  e-quadric.  In  the  basic  coordinate-system  ,f 

€*/*     (a  row,  £  column),  (1-16) 


eaft  =  rO      1 

L/  o 


O  and  /  being  the  null  and  unit  3x3  matrices  respectively.  The  ea^  are  the 
coefficients  in  the  tangential  equation  of  the  same  4-quadric,  so  eayey^  =  8%, 
the  tangential  equation  being 

e**0Pa0Pfi  =  0.  (M7) 

To  the  Riemann  complex  HijKiPiiPkl  —  0  and  to  the  special  complex  of 
lines  in  $3  which  touch  the  fundamental  quadric,  namely 

fltoil'VH  =  Of  (M8) 

where  gmi  =  gik  gfl  -  ga  gjkt  (1-1  9) 

correspond  in  ^5  the  4-quadrics 


—  or  rather  not  the  complete  4-quadrics  themselves  but  their  intersections 
with  the  e-quadric.  Here 


In  the  basic  system  the  matrices  Ra^  g^  have  the  forms 

(1-21) 


f  Equations  (1*16)  are  of  course  non-covariantive.   In  a  basic  coordinate  -system 
the  covariant  and  contra  variant  components  of  the  e-tensors  happen  to  be  the  same. 
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P  and  Q  being  symmetric  3x3  matrices  and  S*  the  transpose  of  the  3x3 
matrix  S  [3,  (3-13),  (3-28),  (3-29)].  The  tangential  equation  of  the  gr-quadric 
is  g"Ppapp  =  0,  g*P  also  being  given  in  the  basic  system  by  the  second  of  the 
matrices  (1-21).  It  should  be  noted  that  g^  and  gaP  are  both  equal  to  the 

6 

unit  6x6  matrix,  and  that  gaflp*pfl  =  £  (pa)2. 


To  the  dual  °22#  w  of  the  Riemann  tensor  corresponds  in  S5  the  4-quadric 
(envelope)  of  equation  °R*(i°p  <>p    =  Oj 

where  °R*f  = 


This  is  the  polar  of  the  4-quadric  Rap  with  respect  to  the  e-quadric.  Thus  the 
processes  in  $3  of  taking  dual  coordinates  and  of  taking  polars  with  respect 
to  the  fundamental  quadric  correspond  in  $5  to  taking  polars  with  respect 
to  the  e-  and  </-quadrics.  Each  of  the  latter  quadrics  is  self-polar  with 
respect  to  the  other  (3,  (3-21)  et  seq.). 

Thus,  as  stated  in  3,  we  have  in  $6  the  4-quadrics  ea/?,  </a/?,  jRay?,  from  the 
last  of  which  we  obtain  : 

(i)  its  e-polar  (envelope)  0R{tP\ 
(ii)  its  g  -polar  (envelope)  RaP\ 

(iii)  the  gr-polar  °R0ip  of  °RaP,  which  is  the  same  as  the  e-polar  of  RaP. 
That  is,  ,B^  _  e^efsRyS  =  g^xp*. 

The  Riemann  complex  may  be  classified  in  the  usual  way  by  means  of 
the  Segre  characteristic  of  the  matrix  pencil  [Rap  —  Aeay9]  (Jessop,  1903, 
pp.  230-232;  Zindler,  1922,  pp.  1128-1131,  1133).  Also,  as  pointed  out  in 
§4  of  3,  the  Segre  characteristic  of  the  pencil  [R^—^g^  has  a  similar 
significance  in  the  present  theory.  The  characteristics  are  called  the  e-  and 
^-characteristics  respectively,  and  the  roots  of  the  corresponding  character- 
istic equations  the  e-  and  gr-roots. 

2.    Change  of  basis 

Let  pa  =  (pl,  ...,p6)  continue  to  represent  the  basic  coordinate-system  in 
$5  obtained  by  an  arbitrary  choice  of  orthogonal  ennuple  h^  in  V±,  the  p* 
thereby  being  equal  to  the  ennuplet  coordinates  (p23,  ...,^>34)  of  a  line,  or 
more  generally  of  a  linear  complex,  in  $3.  Take  in  $6  the  new  coordinate- 
system  defined  by 

(P+P)>     X   =       (P+P)>     X   =    2(P+P«) 

(2<1) 
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or,  as  one  equation,  x*  —  K%P^»  (2*2) 

where  the  6x6  matrix  [Kp]  is  given  by 

'-^-^C-J]-  (2>3) 

By  (I'll)  the  ^'s  are  then  related  to  the  ennuplet  coordinates  in  83  by  the 
equations 


where  yjj 

Evidently  K  =  JK/-1  =  K*9  (2-4) 

so  the  transformation  inverse  to  (2-2)  is 

p*  =  KfaP. 
Under  this  transformation  the  tensor  Rap  becomes! 


=  K1RK     in  matrix  notation, 

=  \[P+Q  +  S  +  &     P-Q-S+Sn  (2-5) 

LP-Q  +  S-&     P+Q-S-S*] 

by  (1*21)  and  (2-3).  Likewise,  by  (1-16)  and  (2-21),  ea/?  and  gafl  become 


-/  O   / 

Thus  gap  is  invariant  under  the  transformation,  and  we  have 

=  (x1?  +  (x2)2  +  (x3)2  -  W?  -  (x5)2  -  (x6) 


W&tf  X-tf  =  S  (AT1)2  (2-8) 

a=*l 

for  the  e-  and  gr-quadrics.  Here,  by  definition, 

(±)«  =  +l     when     «=l,2or3,|  (g  g) 

=  —  1     when     a  =  4,  5  or  6.J 
The  suffix  of  (  +  )a  will  never  sum  unless  preceded  by  S. 

t  For  the  moment  the  new  coordinate-system  will  be  indicated  by  a  bracketed 
suffix  preceding  the  tensor.  This  cumbersome  notation  will  be  discarded  later. 
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From  the  point  of  view  here  adopted,  the  #'s  form  a  new  coordinate- 
system  in  S5,  but  they  may  also  be  regarded  as  pseudo-Klein  coordinates  of 
a  line  in  £3  [Jessop,  1903,  p.  20;  Hudson,  1905,  p.  46.  The  proper  Klein 
coordinates  are  (xac)  =  (x1,  #2,  X3,  —  ^->  —iX5>  ~~*^6)L 

A  change  of  basic  reference  -system  is  brought  about  by  a  rotation  of 
the  ennuple  A*  in  V^.  This  leads  to  an  orthogonal  transformation  of  co- 
ordinates in  $3,  which,  as  is  well  known,  induces  a  particular  kind  of  linear 
transformation  of  the  coordinates  %a  of  $5.  The  following  is  an  exact  formula- 
tion of  this  transformation-theory  in  the  form  needed  for  present  purposes. 
But  a  word  of  caution  is  required.  As  remarked  above,  if  the  fundamental 
quadratic  form  is  not  positive  -definite  at  (#*'),  some  vectors  of  the  basic 
ennuple  fej  are  purely  imaginary  if  the  vectors  define  real  directions  of  V±. 
This  concealment  of  the  signature  of  g^  is  made  in  order  that  all  cases  may 
be  treated  together  without  the  use  of  the  indicators  ea  (Eisenhart,  1926, 
pp.  36,  37)  which  specify  the  signature.  The  non-zero  ennuplet  components 
gab  of  the  fundamental  tensor  are  then  all  +  1  .  To  ignore  the  signature  in 
this  way  may  lead  to  mistakes  unless  care  is  exercised,  but  the  question  of 
signature  fortunately  does  not  seriously  arise  until  reality  -conditions  are 
considered  in  §4.  For  the  moment,  therefore,  it  suffices  to  say  that  the 
symbols  used  below  in  general  represent  complex  numbers,  and  that  for 
the  time  being  we  are  dealing  with  the  complex  rotation  group  (cf.  Cartan, 
1938,  vol.  1,  p.  11,  §8). 

If,  then,  A*,  A?  are  the  respective  coiitravariant  and  covariant  com- 
ponents of  an  arbitrary  orthogonal  ennuple  at  (x1)  in  V±  (hereafter  called  the 
initial  ennuple,),  the  equation  in  ennuplet  coordinates  Xa  =  h^Xi  of  the 
null  cone  at  (xl),  or  of  the  fundamental  quadric  in  $3,  is  as  usual 


=  0.  (2-10) 

A  rotation  of  the  ennuple  in  T^  is  defined  by  equations 

'Ai-fcAJ,     '*?-.***?,  (2-H) 

where  the  coefficients  lba  are  any  that  satisfy  the  equations 

(2-12) 


and  [L%]  is  the  matrix  reciprocal  to  [Zg]  (cf.  Ruse,  1937,  p.  102).  Thus  [Zg] 
is  a  4  x  4  orthogonal  matrix  and  its  reciprocal  [L%]  is  its  transpose.  It  is 
mainly  characterized  by  the  fact  that,  if 

'Xa  =  'hfX* 

=  L%Xb,  (2-13) 

SEE.  2.     VOL.  50.     NO.  2384.  F 
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then  'g^  =  g^  =  d^,  and  therefore 
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that  is,  the  quadratic  form  (2-10)  is  invariant.  In  S3  the  '^aare  coordinates 
having  as  tetrad  of  reference  the  points  whose  coordinates  in  the  non- 
ennuplet  system  are  'h^. 

The  matrices  [l%],  [L%]  are  further  characterized  by  the  requirement, 
indicated  in  (2-12),  that  they  be  unimodular.  We  are  thus  concerned  with 
the  group  of  rotations  in  the  strict  sense,  and  not  with  the  wider  group  of 
rotations -and -reflexions  in  which  the  determinant  of  the  transformation 
may  have  either  of  the  values  ±  1 . 

In  matrix  notation,  (2-13)  is 

'X  =  LX,  (2-14) 

where  £  =  [L%]  (a  row,  6  column). 

Instead  of  regarding  this  as  denning  a  change  of  coordinates  in  $3>  we 
may  regard  it  as  denning  a  direct  collineation  which  leaves  the  fundamental 
quadric  invariant.  Now  (Veblen  and  Young,  1918,  p.  336;  Klein,  1928, 
p.  Ill  et  seq.)  any  such  collineation  transforms  each  regulus  of  the  quadric 
into  itself,  and  is  the  product  of  two  collineations,  each  of  which  transforms 
one  of  the  reguli  into  itself  while  leaving  the  other  invariant.  Thus  L  in 
(2-14)  is  expressible  in  the  form 


L  =  iL 

or,  in  tensor  notation,      Zrg 
where  ^(«)  =  [^ 


2L(/?) 


.  2£g  =  JLf*  .  ^ 


A 


(2-15) 
(2-16) 

(2-17) 


(2-18) 


A   -A   -A 
-A       A       A   -A 

A  -A     A  -A 

A     A     A     AJ 

the  a's  and  /?'s  being  numbers  such  that 

Sa2  =  1,     S/?2  =  1. 

The  determinants  of  these  matrices  are  respectively  equal  to  (Sa2)2,  I 

so  both  have  the  value  unity.  Any  transformation  of  type  (2*17)  leaves  one 


(2-19) 
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regulus  invariant,  and  any  of  type  (2-18)  leaves  the  other  in  variant,  f  That 
any  matrix  ^L(a)  is  commutative  with  any  matrix  2£(/#)  is  geometrically 
obvious. 

An  -^-matrix  becomes  an  2  .L-matrix,  and  vice  versa,  if  the  signs  of  all  the 
elements  in  the  last  row  and  column  are  altered,  the  common  element  a4 
(or  /?4)  thereby  remaining  unchanged  in  sign. 

Suppose  in  particular  that  L  is  itself  of  the  form  iL(a).  Then  (2-15)  still 
holds,  2^(/?)  now  being  the  unit  matrix.  If  pab  is  a  line  (or  more  generally 
any  linear  complex)  in  the  initial  ennuplet  system  Xa,  it  transforms  into 


•p*  =  1£J 
or,  in  matrix  notation,         'p  =  ±L(cx,)  .p  .^(a),  (2-20) 

t  as  usual  denoting  the  transpose.  Substituting  from  (2-17)  and  then  intro- 
ducing the  variables  xa  denned  by  (2-1),  we  find  after  some  calculation  that, 
when  the  coordinates  Xa  in  S3  undergo  the  transformation  ^(ot,),  the  variables 
Xa  in  $5  undergo  the  transformation 


(2-21) 

o 

where  %  is  the  column-matrix  {xl,  .  .  .,  %6},  O,  I  are  as  usual  the  null  and  unit 
3x3  matrices,  and 

A(a)  -  ra!-haf-ai-a|      2(a1a2  +  a3a4)        2(a1a3-a2a4)  "]  .   (2-22) 

2(a1a2-a3a4)      a|  +  al-af-af      2(a2a3  +  a1a4) 
L  2(0^3  +  a2a4)        2(a2a3~a1a4)      a|  +  ag  -  a|  -  afj 


Thus  xl,  x2>  X3  undergo  a  linear  transformation,  while  'x*  =  X*>  'X5  =  X5> 
'XB  —  X6-  The  invariance  of  ^4,  ^5,  XB  shows  that,  in  S6,  the  conic  of  inter- 
section of  the  2-plane  ^4  =  0  =  xb  —  X*  with  the  e-quadric  is  invariant  under 
the  transformation.  This  conic  is  the  one  whose  points  correspond  to  the 
lines  of  the  regulus  in  S3  which  is  transformed  into  itself  by  the  collineation 
•iL(a)t  and  its  invariance  was  therefore  to  be  expected. 

If  we  put  x™  =  (X1,  X2,  X3},     #456  =  (X*>  X5,  A:6},  (2-23) 

t  This  well-known  result  follows  immediately  from  the  theory  of  two  -component 
spinora  (or  semi-spinors  in  the  terminology  of  Cartan,  1938).  If  the  coordinates  of  a 
point  on  the  fundamental  quadric  are  expressed  in  terms  of  such  spinors  (cf.  Ruse, 

1937,  pp.  103,  104),  then  linear  transformations  applied  to  each  spinor  separately  lead 
to  transformations  of  types  (2-17),  (2-18)  respectively.  The  possibility  of  factorizing 
a  proper  rotation,  or  rather  a  proper  Lorentz  transformation,  in  this  way,  was  made 
the  basis  of  the  definition  of  semi  -vectors  by  Einstein  and  Mayer  (1932  ;  also  Murnaghan, 

1938,  p.  358). 

F2 
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curly  brackets  denoting  column-matrices,  then  (2-21)  may  be  written 

'#123  =  A(a)  XlZB>        '#456  =  #456-  (2*23)' 

A(a)  is  an  orthogonal  matrix.  Thus 

A(a).A'(a)  =  / 

and  (V)2  +  ('A'2)2  +  ('X3)2  =  (A!1)2  +  (%2)2  +  (X3)2- 

Its  determinant  has  the  value  (£a2)3,  and  this  is  equal  to  unity,  by  (2-19). 
It  is  the  well-known  matrix  arising  in  the  theory  of  rotations  in  a  Euclidean 
3-space,  the  a's  being  Euler  parameters  (see,  e.g.,  Whittaker,  1927,  p.  8; 
Darboux,  1896,  Note  v,  p.  435;  Klein,  1928,  p.  102;  or  Murnaghan,  1938, 
p.  328).  In  the  language  of  Cartan  (1928,  vol.  2,  p.  61),  %123  and  %466  are 
semi-bivectors. 

If  instead  of  imposing  the  transformation  ^(a)  in  $3  we  now  impose  the 
transformation  2  .£(/?),  so  that  (2-19)  is  replaced  by 

'*  =  ,£</?). p., £>(/?), 
we  find  that  the  %'s  undergo  the  transformation 

'X=ri        0~[x  (2-24) 

10     A(y?)J 

instead  of  (2-21),  A(/?)  being  the  same  function  of  the  f?s  as  A(a)  is  of  the  a's. 
In  this  case,  therefore, 

'XlW  =  Xl23>        '#456  =  A(/7)  X456,  (2-24)' 

so,  as  we  should  expect,  %123  is  invariant  while  %456  undergoes  a  linear  (ortho- 
gonal) transformation. 

From  (2-21)  and  (2-24)  it  follows  that  any  transformation  L  =  ^(oC) .  2£(/?) 
in  $3  which  transforms  the  reguli  simultaneously,  each  into  itself,  brings 
aboxit  in  $6  the  transformation 


L  o     A(/?)_ 

(cf.  Cartan,  1938,  vol.  2,  p.  64,  (17)).  The  quadratic  forms  (x1)2  +  (x2)2  +  (#3)2 
and  (#4)2  +  (x5)2  +  (X6)2  are  separately  invariant,  and  hence,  by  (2-7)  and 
(2' 8),  the  e-  and  0-quadrics  are  invariant.  This  again  was  to  be  expected, 
the  '%'s  being  pseudo-Klein  coordinates  associated  with  the  new  ennuple 
defined  by  (2-11). 

Conversely,  since  any  3x3  unimodular  orthogonal  matrix  can  be  thrown 
into  the  form  (2-22),  the  a's  being  determinate  but  for  the  sign  of  the  set 
as  a  whole,  it  follows  that,  if  we  are  given  in  $5  a  transformation  which 
subjects  Xi2*  an(i  #456  to  separate  unimodular  orthogonal  transformations, 
then  it  can  be  expressed  in  the  form  (2-25)  and  the  corresponding  trans- 
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formation  ^Lfa) .  zL(fi)  in  $3  can  be  determined  but  for  sign.  The  corre- 
sponding new  basic  ennuple  'h*  is  therefore  also  determined  except  for  sign, 
that  is,  except  for  the  orientation  of  the  ennuple  as  a  whole. 

A  simple  type  of  orthogonal  transformation  to  which  #123  or  %456  may  be 
subjected,  and  one  which  will  be  of  importance  below,  is  that  ob tamed  by 
permuting  the  ^123  or  the  x456  among  themselves  (cf.  Hudson,  1905,  p.  118). 
For  present  purposes,  as  will  shortly  be  seen,  it  suffices  to  consider  permuta- 
tions of  x456  only,  which  lead  in  ^  to  rotations  of  type  2£(/?).  Consider,  for 
example,  the  transformation 

which  amounts  to  an  interchange  of  x5,  x6.  (The  minus  signs  are  inserted 
to  make  the  transformation  unimodular,  the  permutation  465  of  456  being 
odd.f)  Symbolically,  /  , 

For  this,  the  matrix  A(/?)  of  (2-24)  or  (2-24)'  is 

A(/?)=r-l       0       0-1, 
0       0-1 

L    o  -i     oj 

from  which,  by  (2-22)  (with  fi's  for  a's),  we  obtain 


(2-26)' 


Choosing  the  +  sign,  we  get  for  2L(fi)  the  matrix J 
-{465}:  Q4  =  7 


(2-27) 


0-11        0_ 
The  correspondinganatrices  for  the  other  odd  permutations  of  #450  are 


-{654}:                              ^5  =  J2 

"0 

-1 

0 

-r 

I 

0 

1 

0 

0 

-1 

0 

1 

1 

0 

-1 

0_ 

(2-28) 


f  All  three  of  the  x'3  have  been  given,  minus  signs,  but  alternatively  one  could  be 
given  a  minus  sign  and  the  other  two  plus.  A  reference  to  this  possibility  is  made  at 
the  end  of  §  2. 

J  The  —{465}  on  the  left  of  (2-27)  indicates  the  transformation  (2-26),  and  is  in- 
serted for  convenience  of  reference. 
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-{546}:  Q6 

The  even  permutation 
/ 

for  which 

gives  for  %L(p)  the  matrix 
+  {564}:  E5-- 
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l 

:V2 


00-1        1 

0     0-1-1 

1100 

-1100 
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(2-29) 


A</?)  =  ro    i    on  , 

0     0     1 

Li    o   oj 

'* 

~-i  -i     i     r 

1-1-1     i 

-i     1-1     i 

(2-30) 


Similarly  for  the  other  even  permutations,  including  456  itself,  we  have 


+  {645}: 


+  {456}: 


-1  -1  1  -1 
1  -1  -1  -1 
1  1  1-1. 

=  the  unit  matrix. 


(2-31) 


(2-32) 


The  transformations  (2-27)-(2-32),  or  rather  the  transformations  of  the 
%'s  which  lead  to  them,  will  be  referred  to  as  signed  permutations.^ 

Thus,  for  example,  if  we  start  with  a  basic  ennuple  h%  and  obtain  the 
corresponding  %'s,  then  the  transformation  (2-26)  in  S5  leads  to  the  trans- 
formation D4  in  £3,  and  hence,  by  (2-11)  and  (2-27),  to  the  new  orthogonal 
ennuple 

(2-33) 


If  the  coordinates  %123  are  permuted  instead  of  %466,  a  similar  set  of  matrices 
is  obtained,  this  time  of  type  iL(a).  They  are  obtained,  in  fact,  by  changing 


f  The  six  matrices  behave  like  the  quaternions  Q±  = 
&*  =  (i-j)l<j2,  E*  =  1»  ^5  =  -$(l-i-j-k),  EI 
Young,  1918,  pp.  337,  338. 


,  O5  =  (k  —  i)/*j2, 
Cf.  Veblen  and 
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the  signs  of  the  elements  in  the  last  row  and  the  last  column  of  each  of  the 
matrices  £2,  E  already  obtained.  For  example,  the  transformation 


for  which 

gives  for  ±1 
-{132}: 


A(«)  =  r- 


the  matrix 


1      r- 


0  1 

-1  0 


1       0 
0    -1 


-100 
0     1    -1 


(2-34) 


(cf.  (2-27)).  In  this  way  we  could  obtain  matrices  Q1?  Q2,  H3,  Elt  E2,  E3  of 
which  Elt  like  E±,  is  the  unit  matrix.  Now  ^  is  obtainable  from  H4  not  only 
by  changing  the  signs  of  the  last  row  and  column,  but  also  by  interchanging 
rows  2,  3  and  changing  the  sign  of  row  4.  Similarly,  D2  is  obtainable  from 
£15  by  interchanging  rows  3,  1  and  changing  the  sign  of  row  4;  and  so  on.  In 
each  case,  it  will  be  noted,  the  distribution  of  zeros  remains  unaltered. 
Likewise  E2  has  the  same  rows  as  EQ  and  E3  as  E5,  though  in  each  case  in  a 
different  order.  Thus  for  a  given  initial  ennuple  in  T£,  the  matrices  Ol9  ...,E3 
produce  the  same  sets  of  vectors  as  D4,  ...,^6  but  differently  ordered,  and, 
in  the  case  of  the  last  vector  of  each  set,  oppositely  directed.  For  example, 
^!  gives 

"h%  =  Qltg,     h\  =    ,   (7*|  +  fcj,  -h\- h\,  -h\  +  h%  h\ - h\),        (2-35) 


which  may  be  compared  with  (2-33).  In  so  far  as  orthogonal  ennuple  is  taken 
to  mean  an  ordered  set  of  four  mutually  perpendicular  unit  vectors,  this  is 
different  from  the  ennuple  (2-33).  But  in  so  far  as  it  is  taken  to  mean  a  set 
of  four  mutually  perpendicular  vectors  irrespective  of  their  order  and 
individual  orientations,  this  is  the  same  ennuple  as  that  defined  by  (2-33). 
The  latter  meaning  is  the  one  that  must  be  adopted  for  the  purposes  of  this 
paper:  in  $3  each  ennuple  represents  a  tetrahedron  self -polar  with  respect 
to  the  fundamental  quadric,  and  it  is  a  matter  of  indifference  in  which  order 
their  vertices  are  taken.  So  far  as  the  underlying^  is  concerned,  the  essential 
point  is  that,  given  a  basic  ennuple  at  (#*),  then  the  six  matrices  Q4,  ...,E6, 
or  alternatively  £119  ...,E3,  define  a  total  of  24  directions  which  fall  into 
6  orthogonal  sets  of  4.  Ennuples  which  are  composed  of  the  same  vectors  but 
in  different  orders  or  with  different  orientations  will  be  called  effectively 
the  same. 
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It  should  be  noted  that  the  product  of  any  two  or  more  of  the  whole  set 
of  twelve  matrices  12,  E,  also  defines  a  rotation,  which,  applied  to  the  basic 
ennuple,  produces  an  ennuple  effectively  the  same  as  one  of  the  six  already 
obtained.  Nothing  essentially  new  is  therefore  to  be  gained  by  considering 
the  signed  permutations  of  x123  as  well  as  those  of  %456,  the  matrices  Q4, . . . ,  E$ 
being  sufficient  by  themselves  to  define  the  24  directions  associated  with  a 
given  basic  ennuple. 

If  in  (2-26)  two  of  the  signs  had  been  taken  plus  and  only  one  minus,  and 
similarly  in  the  case  of  the  other  permutations,  effectively  the  same  ennuples 
would  again  have  been  obtained. 


3.    First  consequences  of  the  condition  of  self -polarity 

The  conditions  (1-4),  (1-6)  for  the  Riemann  complex  to  be  self -polar  of 
the  first  or  second  kind  with  respect  to  the  fundamental  quadric  become, 
in£6, 

R*fi=±°R*fit  (3-1) 

Either  of  these  equations  states  that  the  gr-polar  R^  of  the  4-quadric  Rap 
is  the  same  as  the  e-polar  °JBa^  of  the  same  4-quadric.  This  being  so,  assume 
the  (/-characteristic  to  be  [111111],  and  let  £  be  the  common  self -polar 
simplex  of  the  4-quadrics  JRa^  and  g^.  Take  the  polar  reciprocal  of  the  whole 
configuration  in  S6  with  respect  to  the  (/-quadric.  Then,  as  the  simplex  D, 
the  e-quadric  and  the  (/-quadric  itself  are  all  self-polar  with  respect  to  the 
(/-quadric,  they  reciprocate  into  themselves,  but  the  quadric  Rap  reciprocates 
into  R"P,  which,  by  (3-1),  is  the  same  as  QR*P.  So  21  is  likewise  the  common 
self-polar  simplex  of  the  (/-quadric  and  the  quadric  °R^.  Now  take  the 
polar  reciprocal  with  respect  to  the  e-quadric.  Then  the  (/-quadric,  the 
quadric  °RaP  and  their  common  self -polar  simplex  £  reciprocate  respectively 
into  the  (/-quadric  (which  is  self -polar  with  respect  to  e),  the  original  quadric 
Ra0  and  the  common  self-polar  simplex  of  the  g-  and  7?-quadrics,  i.e.  into  S. 
Thus  2  reciprocates  into  itself,  and  so,  when  the  Riemann  complex  in  S3  is 
self -polar  of  the  first  or  second  kind  with  respect  to  the  fundamental  quadric  and 
is  of  g -characteristic  [1 1 11 1 1],  the  common  self -polar  simplex  of  the  4-quadrics 
Ra0,  gap  in  $6  is  self-reciprocal  with  respect  to  the  4-quadric  e^. 

It  is  described  as  self -reciprocal  and  not  as  self -polar  because,  when  the 
complex  is  self -polar  of  the  second  kind,  the  simplex  is  not  self-polar  with 
respect  to  the  e-quadric  in  the  sense  of  having  each  vertex  the  polar  of  the 
opposite  face,  though  it  does  transform  into  itself.  This  may  be  seen  as 
follows.  Suppose  that  the  Riemann  tensor  satisfies  the  condition  (1*7)  for 
a  self -polar  complex  of  the  second  kind.  Choose  as  basis  the  ennuple  of 
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Ricci  vectors  at  (x*),  that  is,  the  set  of  four  vectors  Aj,  assumed  distinct, 
defined  by  the  Ricci  equation 

(^-/^)^'  =  0.  (3-2) 

In  $3,  (hi,  ...,hl)  are  the  vertices  of  the  common  self  -polar  tetrahedron  of 
the  Ricci  quadric  E^X1^  —  0  and  the  fundamental  quadric,  and  so,  with 
this  tetrahedron  as  frame  of  reference,  the  equations  of  the  two  quadrics 
reduce  to  sums  of  squares,  namely 


pa  being  the  roots  of  the  Ricci  characteristic  equation.  In  other  words,  if 
we  form  the  ennuplet  components  of  the  Ricci  and  fundamental  tensors, 
we  obtain 

Rab  =  diag  {p!,  p2,  p3,  p4},     gab  =  ^as- 

Expressing  (1-7)  in  ennuplet  form  by  writing  abed  for  ijkl,  we  then  find  that 
the  ennuplet  components  Babcd  of  the  Riemann  tensor  are  zero  except  when 
ab,  cd  are  the  same;  in  fact 


J?a/?  being  obtained  as  usual  by  replacing  suffix  -pairs  by  single  numerals  as  in 

4  4 

(1-12).  Also  by  (1-8),  which  may  be  written  2  Raa  —  0>  we  have  S  pa  =  0, 

a=l  a=l 

whence 

Raft  =  diag  {o-i,  <r2,  o-3,  -  crl9  -  cr2,  -  a-3),  (3-3) 

where  o\  =  —  4(p2+Ps)J  etc.  Thus  the  4-quadric  Raft  has  equation 

RaeP^^WY+^W  +  ^(P^z-o-i(p^-o-2(p^-crB(p^  «  0, 
while  by  (1-21)  and  (1'16),  the  g-  and  e-quadrics  are 

0 


The  equations  of  the  R-  and  (/-quadrics  are  thus  sums  of  squares,  and  the 
selected  coordinate-system  is  therefore  the  one  having  their  common  self- 
polar  simplex  as  frame  of  reference.  By  (3-4)  it  is  evident  that  each  of  the 
vertices  of  the  simplex,  for  example  px  =  (l,  0,  0,  0,  0,  0)  =  £f,  lies  on  the 
e-quadric,  a  fact  otherwise  obvious  because  the  vertices  are  the  points  of 
$5  corresponding  to  lines  of  $3,  namely,  the  edges  of  the  common  self  -polar 
tetrahedron  of  the  Ricci  and  fundamental  quadrics.  The  polar  4-plane 
of  the  vertex  $%  has  coordinates  e^d^  or  (0,  0,  0,  1,  0,  0).  Thus  the  common 
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self-polar  simplex  of  R^  and  g^  is  inscribed  in  the  e-quadric  in  such 
a  way  that,  if  its  vertices  are  numbered  1—6  and  the  opposite  faces  also 
numbered  1-6,  then  the  polar  reciprocals  of  the  vertices  1  ,  2,  3,  4,  5,  6  with 
respect  to  the  e-quadric  are  respectively  the  faces  4,  5,  6,  1  ,  2,  3,  and  vice  versa. 
If  /ilt  ...,/46  are  the  </-roots  of  the  self-polar  complex  of  the  second  kind, 
then,  by  (3-3), 


The  e-roots  A±,  .  .  .  ,  A6  are  respectively  i  times  the  </-roots,  and  hence  also 

=  0  =  A2  +  A5  =  A3  +  AG 


(of.  3,  end  of  §5).  The  complex  is  of  the  type  known  as  harmonic  (Jessop, 
1903,  p.  133).  It  is  beyond  the  scope  of  the  present  paper  to  consider  the 
special  cases  that  arise  when  there  are  equalities  among  the  Ricci  roots 

Pl,..-,P4- 

When,  however,  the  Riemann  complex  is  self  -polar  of  the  first  kind,  the 
common  self-polar  tetrahedron  of  the  4-quadrics  R^,  gap  is  also  self-polar, 
in  the  sense  of  having  each  vertex  the  pole  of  the  opposite  face,  with  respect 
to  the  e-quadric;  that  is,  for  a  self-polar  Riemann  complex  of  the  first  kind,  it 
is  possible  to  c/Mose  in  /S5  a  coordinate-system  such  that  the  equations  of  the 
^-quadrics  Ra^  </a/?,  £*$  all  reduce  to  sums  of  squares.  To  prove  this,  choose 
an  arbitrary  initial  ennuple  in  TJ  as  basis,  and  take  the  condition  of  self- 
polarity  in  the  form  (3-1)  (with  the  plus  sign).  Then 

ftap  =  ojj^  (3.5) 

that  is,  g^Ryrt'*  =  &vRyse?e, 

or  in  matrix  notation,  gRg  —  eBe, 

whence  by  (1-16)  and  (1-21),  since  g  is  the  unit  6x6  matrix, 


rp     an  =  ro     /"]  j"P     S~\  ro     I 
L&     GJ      U     oJljS1     QjU     C> 
-TO    /8T, 

U  -PJ 

and  therefore  P  =  Q  and  $  =  8*.  So  /S,  as  well  as  P  and  $,  is  a  symmetric 
matrix,  and  we  have:  for  a  self -polar  complex  of  the  first  kind,  the  Riemann 
4-quadric  in  S&,  referred  to  any  basic  coordinate-system  p«,  has  the  form 

R^=rP     ST\,  (3-6) 

[_S     Pj 

where  P,  S  are  symmetric  3x3  matrices. 
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It  follows  from  (2-5)  that,  if  we  now  transfer  to  the  pseudo-Klein  co- 
ordinates tf*  defined  by  (2-1),  then  RafJ  takes  the  form 

Ra/>=rP+S        0-1  (3-7) 

|_    O        P-SJ 

OH,  (3-8) 

v\ 

where  U—P  +  S,  V  =  P  —  S  are  symmetric  3x3  matrices,  while  ea/?  and  gap 
take  the  forms  (2-6).  The  R^  of  (3-7)  should  strictly  be  distinguished  from 
that  of  (3-6)  by  a  subscript  (%)  as  in  (2-5),  but  there  will  now  be  no  danger  of 
confusion  if  this  tensor  (and  similarly  ga^  eay?)  is  represented  by  the  same 
symbol  in  different  coordinate-systems. 

If,  as  we  have  been  assuming,  the  </  -roots  ofEa/3  are  all  different,  then  the 
latent  roots  of  U  and  V  are  all  different.  Hence,  as  in  the  ordinary  theory  of 
conies  in  a  plane,  it  is  possible  to  find  a  linear  transformation  Xi23~v//Vi23 
of  the  variables  p^1,  #2,  ^3  which  leaves  the  quadratic  form  (x1)2  +  (A!2)2  +  (A!3)2 
invariant  and  simultaneously  reduces  U  to  diagonal  form.  If  the  trans- 

formation is  /  A  /   \  /<>  r>\ 

Xi23  =  A(a)^i2a  (3-9) 

(cf.  (2-23)'),  we  thus  have 

A(a).U.  A'(a)  =  diag  {plt  ^  /*8}  s  Jf18a,  say,  (3-10) 

where  p,l9  /i2?  /*a  are  the  latent  roots  of  U.  The  orthogonal  transformation 
A(a)  is  unique,  inasmuch  as  the  common  self-polar  triangle  of  a  pair  of 
unrelated  conies  is  unique,  except  in  so  far  as  the  variables  ^123  may  be 
permuted  among  themselves  in  the  manner  discussed  in  §  2.  Likewise  an 
orthogonal  transformation 


can  be  found  which  reduces  V  to  diagonal  form:  thus 

^456*  say,  (3-12) 

latent  roots  of  V.  This  transformation  is  also  unique 
except  for  permutations  of  ^456.  Taking  (3-9)  and  (3-11)  together,  we  there- 
fore have  a  transformation  of  type  (2-25),  namely, 

'X  =  rA(«)        O  ~\X,  (3-13) 

L  O       A(/?)J 

such  that,  in  the  '^-system  of  coordinates  in  S5,  Rap  has  the  diagonal  form 

(3-14) 


Inl^,  as  seen  in  §  2,  there  is  a  corresponding  transformation  L  =  ^L(cC]  .  2L(ft) 
which  rotates  the  initial  ennuple  AJ  into  the  new  basic  ennuple  'h?  with 


92  H.  S.  RUSE  [16  Nov. 

respect  to  which  the  'xa  are  pseudo  -Klein  coordinates.  As  the  matrix 
[g*ji\  in  any  such  system  of  coordinates  is  the  unit  6x6  matrix,  it  follows 
that  /*!,  ...,/^6  are  the  g-roots  of  [Rap].  The  ennuple  rhf  will  be  called  a  prin- 
cipal ennuple  and  the  coordinates  '#a  associated  with  it  a  principal  system 
of  pseudo-Klein  coordinates  in  Ss. 

It  is  convenient  here  to  change  the  notation  and  to  represent  fh%  ,  'y* 
by  v",  m*  respectively.  We  then  have  the  following  result,  predicted  above: 

When  the  Riemann  tensor  satisfies  the  Einstein  condition  and  the  roots  of 
the  equation  det  |  R^  —  ^g^  \  =  0  are  all  different,  it  is  possible  to  choose  as 
basis  an  orthogonal  ennuple  v$  at  (x*)  in  such  a  way  that  g^^  pal>pcd,  eabcd  p^pcd, 
Rabcdpabpcd  are  each  reduced  to  sums  of  squares,  namely,  S(rna)2,  S(  ±  )a  (tna)2, 
S/£a(tt7a)2  [(  ±  )a  being  defined  by  (2-7)],  when  expressed  in  terms  of  the  pseudo- 
Klein  coordinates  defined  by  a  _  jr<t~p  (3-15) 


(cf.  (2-2)).  Here  {p*}  =  {p2*,p*1,  ...,£>34}  are  ennuplet  coordinates  with  v%  as 
basis,  and  are  thus  given  by  ab 


The  point-equation  of  the  fundamental  quadric  has  the  usual  ennuplet 
form  2  (Xa)2  ~  °>  fr°m  which  it  follows  (Jessop,  1903,  p.  100;  Hudson, 

1905,  pp.  32,  41,  47)  that  the  quadric  gtj  in  $3  is  one  of  the  ten  "fundamental 
quadrics"  with  respect  to  which  the  complex  is  self -polar — a  not  unexpected 
result.  As  we  shall  not  be  concerned  with  the  other  nine  quadrics,  the  term 
fundamental  will  still  refer  exclusively  to  the  quadric  gti. 
In  the  principal  coordinate-system  ma, 

det  |  Rap  -  Xe^  \  &  det  M128  -  A I  O         ,  (3-16) 

0          Jf456  +  A/ 

by  (3-14)  and  (2-6),  and  hence  the  e-roots  of  \Rap\  are 

/  "i  ~i\_/  \  /'^.17\ 

The  cyclic  condition        Rijld  4-  RiJai  +  R^k  =  0  (3-18) 

satisfied  by  the  Riemann  tensor  may  be  written 


In  S5  this  gives  ^R^  =  0,  (3-19) 

which  means  that  the  4-quadrics  ea/?,  Raft  are  apolar.  Taking  the  principal 
coordinates  w*  in  ^5  and  again  using  (3-14)  and  (2-6),  we  get  from  (3-19) 

/*i+^2+/%-^4-/%-^6  =  °»  (3*2°) 

that  is,  the  sum  of  the  e-roots  is  zero. 
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4.    The  six  sets  of  orthogonal  principal  directions 

It  was  remarked  above  that  the  principal  coordinates  'y?,  now  denoted 
by  wa,  were  unique  except  for  signed  permutations  of  TZ7123  and  tz7456.  As 
seen  in  §  2,  such  permutations  lead  to  rotations  of  the  basic  ennuple  in  T^, 
essentially  the  same  ennuples  being  obtained  from  the  signed  permutations 
of  Tj7123  as  from  those  of  TZ7456.  We  may  thus  confine  our  attention  to  the  latter. 
It  follows  that,  if  v®  are  the  covariant  components  of  the  principal  ortho- 
gonal ennuple  already  obtained,  five  other  essentially  distinct  principal 
ennuples  are  obtained  by  subjecting  it  to  the  rotations  lQ4,  ...}E6  defined  by 
(2-27), ...,  (2-31).  With  any  one  of  these  as  basis,  the  equations  of  the  </-,  e- 
and  ,R-quadrics,  in  the  corresponding  pseudo-Klein  coordinates  in  $5,  are 
sums  of  squares.  In  $3  the  contra  variant  components  of  these  ennuples 
are  respectively  the  coordinates  of  the  six  fundamental  tetrahedra  which 
are  self -polar  with  respect  to  the  fundamental  quadric  (cf.  Hudson,  1905, 
p.  42;  Lamson,  1930,  p.  713). 

Struik  (1927-8),  following  Kretschmann,  shows  how  the  Riemann  tensor 
may  be  used  to  define  15  sets  of  four  "principal  directions"  at  a  point  of  T^. 
His  work  is  discussed  below  in  §§  6,  8.  The  directions  defined  by  the  principal 
ennuples  obtained  above  are  six  of  Struik 's  sets  for  the  case  of  the  self -polar 
complex. 

When  the  fundamental  quadratic  form  at  (#*)  is  positive-definite,  all  of 
the  six  principal  ennuples  define  real  directions  of  T^.  For  in  that  case  all 
components  of  the  initial  ennuple  Af  are  real,  and  hence  the  ennuplet  com- 
ponents of  the  Riemann  tensor  are  real.  Therefore  P,  S  in  (3- (5),  and  con- 
sequently U,  V  in  (3-8),  are  real  symmetric  3x3  matrices,  so  the  orthogonal 
transformations  A(a),  A(/?)  of  (3-10)  and  (3-11),  which  reduce  them  to 
diagonal  form,  are  also  real  (see,  e.g.,  Ferrar,  1941,  p.  153).  It  follows  almost 
immediately  from  (2-22)  and  (2-19)  that  the  a's  and  /?'s  are  real.  Hence  the 
matrices  ^(a),  2-^(A)  defined  by  (2-17),  (2-18)  are  real,  and  therefore  their 
product  defines  a  real  rotation  in  V4.  The  product-rotation  transforms  the 
initial  ennuple  h®  into  the  principal  ennuple  vf,  which  is  therefore  real. 
Finally,  since  the  matrices  Q,  E  are  real,  so  are  all  the  six  principal  ennuples. 

Now  suppose  gy  at  (x*)  in  V±  to  be  of  signature  ( +  +  H — ).  (This  covers 
all  cases  of  one  minus  sign,  since  we  are  at  liberty  from  the  beginning  to 
number  the  coordinates  as  we  please.)  Then  in  the  initial  ennuple,  h{,  A|,  h\ 
are  real  and  h\  is  purely  imaginary.  Hence  the  ennuplet  Riemann  tensor 
Robed  is  rea/l  except  when  one  (and  only  one)  of  its  suffixes  is  4,  in  which  case 
it  is  purely  imaginary.  Therefore  all  the  elements  of  P  in  (3-6)  are  real  and 
all  those  of  8  are  purely  imaginary,  so  Z7,  V  in  (3-8)  are  complex  conjugates, 
and  the  matrix  of  the  transformation  (2-14)  is  therefore  Z/=1jL(a).2^(a*), 
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the  asterisk  denoting  the  complex  conjugate.  From  (2-17)  and  (2-18)  it  is 
immediately  obvious  that  L  is  of  the  form 

r    r    r     H,  (4-1) 

r     r     i-     i 

r     r     r     i 

i      i     i     r 

where  "r"  denotes  "real"  and  "i"  "purely  imaginary".  Since,  with  the 
same  symbolism,  ro_./iLi  JA\  _  /•  -\ 

it  follows  that  the  rotation  (4-1)  applied  to  h®  produces  a  principal  ennuple 
v%  which  is  also  of  type  (r,  r,  r,  i)  and  which,  therefore,  defines  a  set  of  four 
real  directions  in  V^.  But  by  (2-27), .. .,  (2-31)  it  is  obvious  that  the  rotations 
£24,  H6,  O6,  jEr5,  EQ  applied  to  v%  do  not  produce  ennuples  of  the  same  type. 
So  the  remaining  five  principal  ennuples  do  not  define  real  directions  of  V^ 
and  we  are  thus  able  to  conclude  that,  when  the  Riemann  tensor  satisfies  the 
Einstein  condition  and  the  signature  of  gtj  is  +  2,  only  one  of  the  six  principal 
orthogonal  ennuples  defines  real  directions  of  V±  (cf.  Lamson,  1930,  p.  721). 

The  same  conclusion  holds  if  the  signature  is  —2  [case  (— * H)].  From 

the  point  of  view  of  relativity  the  existence  in  empty  space-time  of  a  unique 
set  of  real  principal  directions  intimately  connected  with  the  Riemann 
tensor  is  not  without  interest,  especially  in  view  of  the  non-existence  of 
Ricci  principal  directions.  It  suggests  that,  at  every  world-point  at  which 
the  momentum -energy  tensor  is  zero,  there  exists  a  coordinate -system  which 
is  mathematically,  if  not  physically,  to  be  preferred.  It  will  be  shown  in  a 
later  paper  that  the  principal  directions  defined  for  a  gravitational  field  by 
A.  G.  D.  Watson  (1941)  are,  in  fact,  the  same  as  those  obtained  above. 

If  the  signature  is  ( +  H ),  which  covers  all  cases  of  two  minus  signs, 

no  definite  statement  can  be  made  about  the  reality  of  the  principal  direc- 
tions. For  suppose  as  usual  that  the  initial  ennuple  h%  defines  real  directions. 
It  is  then  of  type  (r,  r,  i,  i),  and  P,  St  U,  V  are  all  of  type 

r     r     il.  (4-2) 


Suppose,  as  a  particular  case,  that  the  imaginary  elements  of  U,  V  happen 
to  be  zero,  so  that  they  are  both  of  the  type 

r     01.  (4-3) 

r     0 
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Then  they  are  both  real  and  symmetric,  and  the  matrices  A(a),  A(/?)  which 
reduce  them  to  diagonal  form  are  therefore  also  real  and  have  zeros  dis- 
tributed as  in  (4-3).  By  (2-22),  (2-17)  and  (2-18)  it  at  once  follows  that 
17/(a),  2£(/?)  have  one  or  other  of  the  forms 

(4-4) 


(4-5) 


They  may  both  be  of  form  (4-4)  or  both  of  form  (4-5),  or  one  may  be  of  form 
(4-4)  and  one  of  form  (4-5).  In  each  case  their  product  L  also  has  one  of 
these  forms.  Applied  to  the  initial  ennuple  (r,  r,  i,  i),  L  therefore  leads  to  a 
principal  ennuple  either  of  type  (r,  r,  i,  i)  or  of  type  (i,  i,  r,  r).  Of  these  the 
former  defines  real  directions  in  V4,  but  the  other  principal  ennuples  obtained 
from  it  by  applying  the  rotations  Q,  E  do  not  yield  real  directions.  The  latter 
does  not  itself  define  real  directions,  but  the  rotation  O6  (see  (2-29))  makes 
it  (r,  r,  i,  i)  and  so  produces  an  ennuple  which  does  define  real  directions.  So 

in  the  case  ( +  H )  there  may  be  a  set  of  real  principal  directions. 

On  the  other  hand,  there  may  be  none.  For  suppose  that  there  were  always 
such  a  set,  defined  by  a  principal  ennuple  v^  =  (r,  r,  i,  i).  With  this  as  basis, 
U  and  V  would  both  have  diagonal  form  with  real  elements,  as  follows 
from  (4-2)  with  all  but  the  diagonal  elements  zero,  and  these  real  elements 
would  be  the  respective  latent  roots  of  the  matices  U,  K  But  the  latent 
roots  of  symmetric  (not  Hermitian)  matrices  of  type  (4-2)  are  not  in  general 
all  real,f  so  there  cannot  in  general  be  a  principal  ennuple  defining  real 
directions  of  V±. 

5.    Equal  g -roots :  example  of  the  Schwarzschild  space-time 
It  has  been  assumed  throughout  that  the  g-roota  filt  ...,{ie  are  all  different. 

This  restriction  may  now  be  removed,  but  it  must  still  be  assumed  that  the 

elementary  divisors  of  [Rap  —  ju>gap]  are  simple. 

In  §  3  the  uniqueness  of  the  transformations  A  (a),  A(/?)  (but  for  signed 

permutations  of  the  principal  coordinates  H7123  or  T?7456)  merely  required  the 


f  For  example,  if  all  the  real  elements  are  zero  and  the  purely  imaginary  ones 
non-zero,  one  latent  root  is  zero  and  the  other  two  are  purely  imaginary. 
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assumption  that  /*123  =  {/*1?  fa,  ^  and  /£456  =  {/*4>/*6>/*6}  were  ftU  different. 
It  did  not  require  that  the  whole  set  of  six  numbers  plt . . . ,  /£6  should  be  all 
different.  Therefore  the  uniqueness  is  unaffected  if  one  or  more  of  /^m  are 
respectively  equal  to  one  or  more  of  /^456,  provided  that  no  two  numbers 
of  the  same  triad  are  equal.  So  the  transformations  remain  unique,  but 
for  the  signed  permutations,  and  the  number  of  sets  of  principal  orthogonal 
directions  is  still  six,  when 

either  (i)          /^  =  //4    with  the  /^'s  otherwise  unequal; 

or  (ii)  /*1=/*4,  /*2=/*5>  /^3  =  /*6  (/h,/*a»/*8une(lua1)- 
This  is  exhaustive.  For  by  (3-20)  we  cannot  have  /^  =  /*4  and  /i2  —  /45 
without  also  having  /^3  =  /^6,  while  a  possibility  such  as  /£t  =  /£4,  /£2  =  /^> 
fa  =3  ^5  is  taken  care  of  by  applying  the  signed  permutation  'ti7456  =  —  Tzr465 
to  case  (ii).  The  ^-characteristics  in  these  two  cases  are  respectively 
[(11)1111],  [(11)  (11)  (11)].  Since  by  (3-17)  the  e-roots  are  (/4123,  -^456), 
the  e- characteristic  in  each  case  is  in  general  [111111],  but  has  a  more 
specialized  form  if  any  of  the  equal  /^'s  are  zero  or  if  any  of  /^123  are  equal  to 
anyof-/*456. 

If  any  of  /£123  are  equal,  there  are  an  infinite  number  of  matrices  A(a) 
which  reduce  U  in  (3-8)  to  diagonal  form.  For  example,  if  /£2  =/£3^/*1, 
then,  interpreting  ^123  momentarily  as  coordinates  in  a  protective  plane, 
the  conies  ^1213^123  =  0,  %i23Xi23  —  0  have  double  contact  and  therefore 
have  an  infinite  number  of  common  self -polar  triangles.  Similarly,  if  /^466 
are  not  all  different,  there  are  an  infinite  number  of  matrices  A(/?)  which 
reduce  V  to  diagonal  form.  An  example  is  provided  by  the  Schwarzschild 
space-time  of  relativity,  for  which 

cfc?2  =  •^rp-ldr2-r2d62-r2sm20d(f)2+pr-1dt2     (p  =  r-  2m,  m  const.). 
The  non-zero  distinct  components  of  the  Riemann  tensor  are 

#2323  =  -  2mr  Sin2  # >       ^3131  =  mP~1  sin2  Q>       ^1212  =  mp~l, 

J?1414  =  2mr~3y  ^2424  —  ~~  w/o^*~2j  #3434  =  ~  mpr~2  sin2  0. 

Taking  as  basic  ennuple 

h{=si(p/r)*,      0,  0,  0; 

fc|s      0,         i/r,  0,  0; 

hi-     0,          0,      i/(rsinO),        0; 

hl^     0,          0,  0,  (r/p)*, 

the  covariant  components  of  which  are  obtained  by  replacing  the  non-zero 
elements  by  their  reciprocals,  we  obtain  the  following  ennuplet  components: 

#2323  —  ~  2W//3  =  #1414,       J?3131  =  m/l*  =  RIZ12  =  #2424  =  #3434- 


(5-1)- 
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Hence  the  matrix  Raj3  of  (3-6)  is  in  this  case  already  hi  diagonal  form  (of. 
Watson,  1941,  p.  16)  and  (5-1)  is  a  principal  ennuple.  We  have  in  fact 

Rap  =  mr~3diag{-2,  1,  1,  -2,  1,  1},  (5-2) 

so  the  8  and  P  of  (3-6)  have  the  values  S  =  0  and  P  =  mr-3diag{-2, 1, 1}. 
Hence  the  Ut  V  of  (3-8)  are  both  equal  to 

vn 

--2     0     OH,  (5-3) 

0     1     0 
0     0     1J 

=  —  2m /r3,  JLCZ  =  /£3  =  /*5  =  /*6  =  m/r3.  If 
A(a)^fl  0  01,  (5-4) 

0       cos  2a     sin  2a  I 
LO   —  sin  2a     cos  2aJ 
a  being  any  angle,  then         A(a) .  U.  A'(ct)  =  U, 

that  is,  U  is  invariant  under  the  transformation  A(a).  Similarly,  F  is  in- 
variant under  any  transformation  A(/?),  /?  being  any  angle.  With  A(a) 
given  by  (5-4),  the  a's  of  (2-22)  have  the  values 

ocj  =  sui  a,     a2  =  0  =  a3,     a4  =  cos  a 
(or  minus  these).  Similarly,  for  A(/?), 


and  the  gr-roots  are  therefore  /^  = 


Substituting  in  the  formulae  (2-17)  and  (2-18)  for 
plying,  we  obtain  in  T^  the  rotation 


L  — 


cos  (a  —  /?) 
0 
0 
_-sin(a-/?) 


0 
cos  (a 


0 


0 


r(a),  %L(ft),  and  multi- 
sin  (a-  /?)"].       (5-5) 

0 

0 
cos  (a—  /?)_ 


This,  apph'ed  to  the  covariant  components  of  the  ennuple  (5-1)  (see  (2-11)) 
produces  a  new  ennuple,  which,  taken  as  basis  for  pseudo-Klein  coordinates, 
reduces  Ra/)  to  diagonal  form  whatever  the  angles  a,  ft.  The  principal  ennuple 
(5-1)  is  therefore  unique  except  in  so  far  as  the  symmetry  properties  of  the 
Schwarzschild  V4  permit  this  particular  type  of  rotation. 

If  the  elementary  divisors  of  U  —  ju,I  or  V  —  pI  and  hence  of  [Rap  —  WJaf!\> 
are  not  simple,  then  U  or  V  are  not  reducible  to  diagonal  form  by  orthogonal 
transformations,  and  there  are  no  principal  directions  in  the  sense  defined 
above. 
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6.    The  fifteen  principal  tetrahedra  in  $3 

The  geometrical  background  of  the  preceding  analysis  is  so  well  known 
that  only  a  brief  account  of  it  need  be  given  here. 

A  general  quadratic  complex  defines  in  $3  a  set  of  30  lines  which  are  the 
edges  of  15  tetrahedra.  These  have  a  total  of  60  vertices  and  60  faces,  the 
whole  forming  a  Klein's  6015  configuration  (Hudson,  1905,  pp.  41,  42; 
Jessop,  1903,  p.  37).  Of  the  15  tetrahedra,  9  are  inscribed  in  any  one  of  the 
]  0  fundamental  quadrics  and  the  remaining  6  are  self-polar  (Hudson,  ibid. 
p.  42).  The  6  that  are  self -polar  with  respect  to  the  quadric  g^  define  the 
6  sets  of  principal  orthogonal  directions  discussed  above. 

The  existence  of  the  15  principal  tetrahedra  lies  behind  the  work  of 
Struik  (1927-8)  and  Churchill  (1932).  Struik,  dealing  with  the  general  case 
when  the  Riemann  tensor  does  not  necessarily  satisfy  the  Einstein  con- 
dition, defines  15  sets  of  4  "principal  directions",  which  are,  in  fact,  the 
directions  in  V4  corresponding  to  the  vertices  of  the  15  tetrahedra  in  $3. 
He  makes  only  passing  reference  to  the  geometry.  Churchill,  dealing  with 
the  self-dual  part  G^M  of  the  Riemann  tensor,  obtains  in  his  §  13  six  "elemen- 
tary six-vector  skeletons".  Such  a  skeleton  is  a  set  of  six  simple  bivectors, 
which,  in  S3,  represent  the  six  edges  of  one  of  the  six  self -polar  tetrahedra. 

The  existence  of  the  30  lines  is  obvious  from  the  5 -dimensional  repre- 
sentation. For,  in  !S5,  the  common  self-polar  simplex  of  the  4-quadrics 
Rap  and  ea/3  has  6  vertices.  The  lines  joining  these  are  15  (  =  BC2)  in  number, 
and  each  of  them  cuts  the  e-quadric  in  two  points  corresponding  to  lines 
of  $3.  That  the  30  lines  form  15  tetrahedra  is  less  obvious,  but  is  easily 
arguable  in  terms  of  the  geometry  either  of  S5  or  of  $3.  In  the  latter  space 
they  are  the  pairs  of  directrices  of  the  linear  congruences  obtained  by  taking 
in  pairs  the  linear  complexes  corresponding  to  the  vertices  of  the  self-polar 
simplex  in  $5.  All  this  is  so  well  known  that  no  further  discussion  is  needed 
here,  §§  7  and  8  below  being  devoted  to  working  out  the  analytical  conse- 
quences of  the  geometry  more  completely  than  is  done  by  Struik  or  by 
Churchill.  The  latter's  treatment,  being  4-dimensional,  rather  obscures 
the  connexion  of  his  theory  with  these  classical  elementary  properties  of 
quadratic  complexes. 

7.    The  principal  bivectors 

What  may  be  conveniently  called  a  principal  e-bivector  at  the  point 
(#*)  of  V±  is  any  skew  tensor  qij  that  satisfies  the  equation 

</20H-A<to)g«  =  0  (7-1) 

for  some  value  of  A.  When  the  e-roots  A1?  ...,A6  are  all  different,  which 
we  assume  from  now  on,  there  are  six  such  bivectors,  each  one  being 
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determinate  but   for  a  scalar   factor    of  proportionality.  In   $5,    (7-1) 

becomes                                     /n        ,       .     ..       .  ._  _. 

(Ra/}-Xea/i)qfi=Ot  (7-2) 


which  states  that  the  point  q*  has  the  same  polar  4-plane  with  respect  to 
the  4-quadrics  Rap  and  ea/?.  The  six  points  qa  thus  obtained  are  in  fact  the 
vertices  of  the  common  self-polar  simplex  of  the  4-quadrics.  In  $3  the  qij 
define  the  fundamental  linear  complexes  of  the  Riemann  complex  (Jessop, 
1903,  p.  26;  Hudson,  1905,  p.  33). 

A  principal  g-bivector  r^  is  likewise  defined,  except  to  the  extent  of  a 
scalar  factor,  by  the  equation 

(*0«-W«H)»*  =  0,  (7-3) 

or,  in  8,,  (Raf  -  ftgxff)  r"  =  0,  (7-4) 

[A  being  one  of  the  six  {/-roots,  also  assumed  all  different.")*  The  six  points 
ra  in  $5  are  the  vertices  of  the  common  self  -polar  simplex  of  the  4-quadrics 

R<xfl>  ffafi- 

It  was  seen  in  §  3  that,  when  the  Riemann  complex  is  self  -polar  of  the 
first  kind,  the  Riemann  tensor  thereby  satisfying  the  Einstein  condition 
at  (x1),  the  two  simplexes  qa  and  ra  are  the  same,  and  that,  when  this  common 
self  -polar  simplex  is  taken  as  frame  of  reference  for  coordinates  wa  (the 
"principal  coordinates")  in  S5,  then 

23       O  I,     ea,  =  ri       O  I,     ffafl  =  ri     01,        (7-5) 

j/45J  lo    -/J  \_p   /J 

where  Jf123  =  diag^,^,/^},  M^  =  diag{^4,/*5,/^6},  the  /^'s  being  the 
0-roots.  The  e-roots  are  (/*i,/*2»/*3»  ~/^4»  ~  fi&  —fa)-  ^°  in  ^nis  case  we  have 
one  set  of  six  points  <?£)=  (qfi),  ••.,  <?(e))  such  that,  by  (7*2)  and  (7-4), 

Rafitf*)  =   (  ±  )K^^  =  H*Qa&ft  (7>6) 

(not  summed  for  K),  where,  as  in  (2-9),  (  +  )K  =  1  if  AC  =  1,  2  or  3,  and(  ±  )K  =  ~  1 
if  AC  =  4,  5  or  6.  In  $3,  and  hence  V^  these  give  the  equations 

tfy«0g)  =  H  ±  )K/*KeiW$)  =  tMvkiVM  (7'7) 

satisfied  by  the  principal  linear  complexes  (bivectors).  The  last  equality 
may  be  written 


.     . 

(  ±    ) 


f  The  existence  in  a  general  Riemannian  space  of  two  distinct  sets  of  principal 
bivectors  is  a  peculiarity  of  F4,  in  which  the  e-tensors  share  with  the  Riemann  tensor 
the  property  of  having  four  suffixes.  A  Vn  (n  =fc  4)  has  principal  gf-bivectors  but  not 
principal  e  -bivectors. 

G2 
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which  means,  in  $3,  that  the  linear  complexes  qfy  are  self-polar  with  respect 
to  the  fundamental  quadric,  and,  in  T£,  that  the  three  bivectors  for  which 
K  =  1,  2,  3  are  self-dual  and  the  other  three  anti-self-dual. 

As  the  q*K)  in  $5  are  the  vertices  of  the  simplex  of  reference  of  the  principal 
coordinate-system  ma,  we  have,  in  that  system,  by  a  suitable  choice  of  the 
factor  of  proportionality, 

gg)  =(1,0,  0,0,  0,0),..., 
or,  as  one  (non-covariantive)  equation, 

<?(%  =  e  (7-8) 

From  this  and  (7-5)  follow  at  once  the  covariantive  equations 


(no  summation  for  /c).  It  will  be  seen  that  these  are  in  agreement  with  (7-6). 
For  K^O~,  these  equations  express  the  conjugacy  of  the  six  points  q*K)  in 
$6  with  respect  to  each  of  the  4-quadrics  Ra^  ea>?,  g^.  In  T^  they  give  the 
relations 


satisfied  by  the  principal  bivectors.  For  K=£CT,  (7-13)  states  that  the  linear 
complexes  q^  in  $3  are  mutually  in  involution;  (7-14)  that  each  is  in  in- 
volution with  its  polar  q^j  with  respect  to  the  fundamental  quadric;  and 
(7-12)  that  each  q^  is  in  involution  with  the  polar  complex  Rijklqfy  of  any 
one  of  the  others  with  respect  to  the  Riemann  complex. 
,  by  (7-5)  and  (7-8), 


=  2 

/c-l 


from  which  we  obtain:  //  q^  (K  =  1,  ...,  6)  are  the  covariant  components, 
suitably  normalized,  of  the  principal  bivectors  defined  by  the  Riemann  tensor 
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at  a  point  of  V^  at  which  the  Einstein  condition  is  satisfied,  and  if  the  g-roots 

^ 

S  /« 

K 

E  (  ±  ) 


are  all  different,  then      „  ^  ,.  ,  0. 

JJ  RUM  =  S  /«Gc>tftffe>«»  (7*18) 


(7*2°) 

K 

These  relations  may  also  be  established  as  follows.  The  formula  giving 
the  Riemann  tensor  in  terms  of  the  EajS  of  $5  is 


yfj  being  defined  as  in  (1-13),  with  similar  relations  for  ei:fkl  and 
observed  in  §  1  ,  a,  /?  may  be  taken  to  relate  to  any  coordinate-system  in 
$5  provided  that  yjy  is  treated  as  a  tensor  of  the  type  indicated  by  its 
suffixes.  If  in  $5  we  take  the  principal  coordinates  w",  then,  by  (7-5),  (7-21) 
becomes  o 

=  S 


with  similar  relations  for  e^w  and  g^^  Comparison  with  (7-18),  (7-19), 
(7-20)  shows  that  the  two  sets  of  equations  are  the  same  if  this  particular 
yfj  is  identified,  as  it  must  be,  with 


8.    The  6  +  9  sets  of  principal  directions 
Taking  any  system  of  coordinates  X*  in  $5,  define  lfK)  by  the  relations 


v 

%  =  ^  (9g)  -  ^4)),        %)  =  ^2  (^("2)  -  9?5))>        ^(a6)  =  ^  (S&  ~  2(6)) 

[cf.  Struik,  1927-8,  eqns.  (11),  (12);  Churchill,  1932,  eqns.  (13-2)],  which, 
by  (2-2)  and  (2-4),  may  be  written 

Jfo  =  Kl$ry  (8-2) 

By  (7-10)  all  the  I*B  satisfy  the  equation 

6afll*V  =  0,  (8-3) 

so  they  all  represent  points  on  the  e-quadric  in  S5.  Zg)  and  lf$  are  thus  the 
points  where  the  line  joining  the  vertices  gg),  qfo  of  the  common  self-polar 
simplex  of  the  4-quadrics  meets  the  e-quadric,  and  similarly  Ify,  Ifo  and 
'§)»  %)  are  points  on  the  edges  25,  36  of  the  simplex.  The  relations  (8-1) 
Trill  be  symbolized  by  ;  =  ?(1  ±4>2±  5i3±6))  (8.4) 
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the  upper  signs  referring  to  the  first  line  and  the  lower  to  the  second  line 
of  (8-1). 

By  (7-10),  ^^  =  0,     |*-<r|*3,  (8-5) 

=  1,     |*-(r|=3.  (8-6) 

These  equations  mean  that  each  point  lfK)  in  $5  is  conjugate  with  respect 
to  the  e-quadric  to  all  the  others,  with  the  exception  of  the  point  Zj±8)  on 
the  same  edge  of  the  simplex  as  itself. 


In  Sa,  (8-3)  becomes  iew  W  =  0,  (8-7) 

expressing  the  otherwise  obvious  fact  that  the  1$  are  the  coordinates  of 
lines  in  $3  and  not  of  non-special  linear  complexes;  and  (8-5)  becomes 


which,  for  K  —  <r,  is  the  same  as  (8-7),  and  for  AC=t=cr  states  that  each  of  the 
lines  Ify  meets  four  of  the  others  but  not  the  fifth  (namely,  the  one  for  which 
|  K  —  or  |  =  3).  So  the  six  lines  are  the  edges  of  a  tetrahedron  in  $3.  Moreover, 
since  by  (7-11)  and  (8-1), 

Qapk^W  —  °K<T» 

which,  for  /c=J=er,  states  that  each  point  lfK)  in  S5  is  conjugate  to  every  other 
with  respect  to  the  {/-quadric,  we  have  in  $3 


which  may  be  written      to^^g/g)  =  0     (K  4=  o-),  (8-8) 

=  2     (ic  =  a).  (8-9) 

Equations  (8-8)  mean  that  each  edge  ^  of  the  tetrahedron  meets  the  polar 
of  every  other  edge  with  respect  to  the  fundamental  quadric,  and  (8-9) 
that  it  does  not  meet  its  own  polar.  Hence  the  tetrahedron  is  self  -polar  with 
respect  to  the  fundamental  quadric.  If,  then,  v£=(t?j,  ...,v|)  are  the  four 
vertices  of  the  tetrahedron,  the  polar  plane  gr^vj  of  any  one  of  the  vertices 
contains  each  of  the  other  three  vertices,  and  so 


If  the  v's  are  normalized  so  as  to  make  the  left-hand  side  equal  to  unity 
when  a  =  6,  which,  by  (8-9),  is  consistent  with  taking  the  Pliicker  coordinates 
of  the  edges  to  be  exactly 
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and  not  scalar  multiples  of  these,  we  have 

WM^Sa*,  (8'n) 

and  the  t>'s  therefore  constitute  an  orthogonal  ennuple  at  (xl)  in  14. 

Take  vj  as  basic  ennuple  in  V4.  The  ennuplet  components  of  the  Riemann 
tensor  are  then  ^ijkivavlvcvd^  which,  because  of  the  skewness  of  R^w  in 
i,  j  and  k,  I,  is  identically  equal  to 


Replacing  suffix-pairs  by  single  numerals  in  the  usual  way,  it  is  at  once 
evident  from  (8-10)  that  the  equation  of  the  J?-quadric  in  $5  in  the  basic 
coordinate-system  p*  associated  with  v^  is 

RarP^P7  =  0, 

where  Rffr  =  IRW1&1%} 

by  (8-2) 


=  S  P  ?££&(     by  (7-12), 
P-I 

n 

whence  RarPaPT  =  2 

/3-1 

by  (3-15),  rzT/'  being  the  pseudo-Klein  coordinates  associated  with  vj.  The 
fact  that  the  quadratic  form  on  the  right  of  (8-12)  is  a  sum  of  squares  with 
coefficients  ju,p  shows  that  the  w?  are  the  principal  coordinates  for  $5  found 
in  §3,  and  therefore  that  vj,  as  denned  by  (8-10),  is  the  principal  ennuple 
associated  with  them. 

Five  other  principal  ennuples  were  obtained  in  §  4  by  imposing  signed 
permutations  upon  the  w's.  These  are  also  obtainable  by  taking  sets  of  edges 
of  the  self  -polar  simplex  in  $5  other  than  the  edges  14,  25,  36  of  (8-1)  or 
(8-4).  They  arise,  in  fact,  if  the  qfa,  gfa,  qfa  of  (8-1)  are  subjected  to  exactly 
the  same  signed  permutations  as  r<74,  wb,  WQ.  So,  for  example,  the  ennuple 
derived  from  the  signed  permutation  —  {465}  of  (2-26)  and  (2-27)  is  obtained 
if,  in  (8-1),  we  replace  gfa,  gf5),  gfa  respectively  by  -gf4),  -qfa,  -qfa.  In  this 
way  we  obtain,  in  the  symbolic  notation  of  (8-4),  the  following  five  other 
ways  of  "defining  the  Z's  in  terms  of  the  g's: 

I  =  g(l  +  4,2  +  6,3  +  5), 

I  =  0(1*  6,  2  +  5,  3  +  4), 

£  =  2(1  +  5,2  +  4,3  +  6), 

I  =  </(!  +  5,  2  ±6,  3  ±4), 

1  =  ?(1  ±6,  2  ±4,  3  ±5). 

These  correspond  respectively  to  the  transformations  (2-27),  ...,  (2-31), 
and  yield  exactly  the  same  principal  orthogonal  ennuples  as  before. 
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The  six  formulae  for  the  Va  in  terms  of  the  #'s  are  characterized  by  having 
the  suffixes  1,  2,  3  of  the  g's  paired  in  all  possible  ways  with  4,  5,  6,  but  never 
with  one  another.  If  1,  2,  3  are  allowed  to  pair  with  one  another  as  well  as 
with  4,  5,  6,  we  obtain  the  complete  set  of  15  ways  of  pairing  the  suffixes 
of  the  q'&  into  sets  of  three,  namely  the  six  given  above  and  nine  others. 
Consider,  for  example,  the  effect  of  taking  the  combination  14,  23,  56. 
Define  the  Ts  by  the  formulae 

(8-13) 


The  factor  +  i  appears  before  one  of  the  g's  whenever  two  of  the  suffixes 
1,  2,  3  or  two  of  4,  5,  6  appear  together,  and  the  signs  are  chosen  so  as  to 
make  (8-13)  consistent  with  (8-15),  (8-16)  below.  As  before, 


so  the  corresponding  lines  of  SB  again  form  a  tetrahedron.  This  tetrahedron 
is  self-reciprocal  with  respect  to  the  fundamental  quadric,  but  not  in  the 
sense  that  each  vertex  is  the  pole  of  the  opposite  face.  For  by  (7-6)  we  now 

£•        1ft          if        is  

=   — t,~fivfiA        11        t   —    • 


which  express  the  fact  that,  in  $3,  the  lines  ZjJ  for  K  =  2,  3,  5,  6  are  self- 
polar  with  respect  to  the  fundamental  quadric,  that  is,  that  they  are 
generators.  Also  it  follows  from  (8-14)  that  generators  2,  5  meet  each  of 
3,  6  but  do  not  meet  one  another,  so  2,  5  belong  to  one  regulus  and  3,  6  to  the 
other.  Hence  the  lines  ZjJ>  for  /c  =  2, 3, 5, 6  form  a  skew  quadrilateral  on  the 
fundamental  quadric,  and  the  remaining  lines  (K  =  1,4)  are  its  diagonals. 
The  vertices  u\  of  this  quadrilateral,  denned  by 

Zg  =  ulu33-uiul  etc.,  (8-15) 

thus  lie  on  the  fundamental  quadric  and  so  define  a  principal  set  of  null 
vectors  of  T£. 

The  same  set  of  null  vectors  could  be  obtained  by  subjecting  the  principal 
coordinates  m«  to  the  transformation 
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in  the  manner  of  §  2.  To  this  transformation  of  coordinates  in  $5  corresponds 
in  $3  the  transformation  'Xa  =  LaXb  (8*17) 

1 


where  [Lg]  =  -^ 


OOf 


0       1    -i     0 

0   -i       1     0 

.*      °      °     l. 

The  covariant  components  of  the  null  vectors  u%  are  then  connected  with 
those  of  the  principal  orthogonal  ennuple  v^  by  the  formula 


(8-18) 


The  matrix  (8-18)  is  non-orthogonal.  It  is  easy  to  show  from  (8-17)  and 

(8-18)  that  4 

S  (Xa)z  =  2i(' X*  'X*  -  'X1  'X*), 

a=l 

so  that  the  form  E(Za)2  of  the  fundamental  quadric  is  not  preserved  when 
in]^  we  take  as  basis  the  set  of  vectors  u\  instead  of  v* ;  which  is  geometrically 
to  be  expected  because  the  tetrahedron  ula  in  $3  is  not  self -polar  with  respect 
to,  but  is  inscribed  in,  the  fundamental  quadric.  It  should  be  noted  that  the 
transformation  (8-16),  unlike  those  of  §2,  does  not  permute  the  Tt7123  or  the 
Tzr456  among  themselves,  but  virtually  interchanges  m*  and  rn5.  It  leaves 
invariant  the  form  S(  ±  )a  (wa)2  of  the  e-quadric,  but  not  the  form  £(ora)2 
of  the  </-quadric. 

It  is  beyond  the  scope  of  the  present  paper  to  study  the  9  sets  of  null 
directions  in  detail.  It  is  enough  to  say  that  each  of  the  corresponding 
9  tetrahedra  in  $3  has  two  pairs  of  opposite  edges  as  generators  forming  a 
skew  quadrilateral  on  the  fundamental  quadric,  and  that,  if  the  whole  set 
of  9  are  taken,  6  generators  of  each  system  are  involved.  These  two  sets  of 
6  generators  correspond  respectively  to  the  points  in  S5  where  the  edges 
23,31,  12  and  56,  64,  45  of  the  common  self-polar  simplex  meet  the  e-quadric. 
All  other  edges  meet  the  e-quadric  in  points  corresponding  to  the  diagonals 
of  the  9  skew  quadrilaterals,  out  of  which  diagonals  are  also  formed  the 
6  self-polar  tetrahedra  already  considered.  The  36  intersections  of  the  6+6 
generators  give  36  points  u*  defining  null  vectors  in  V±  (these  being  separable 
into  9  sets  of  4,  as  above),  the  6  null  vectors  corresponding  to  the  points  on 
any  one  generator  being  mutually  perpendicular  because  the  points  are  all 
conjugate  with  respect  to  the  fundamental  quadric.  As  each  point  is  the 
intersection  of  2  generators,  one  of  each  system,  each  of  the  36  null  vectors 
in  V4  is  perpendicular  to  itself  and  to  10  of  the  others. 

In  the  general  case  when  the  Riemann  tensor  does  not  satisfy  the  Einstein 
condition  there  are  still  15  sets  of  "principal  directions"  defined  by  the 
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Riemann  tensor,  but  they  are  in  general  all  non-null  and  non-orthogonal, 
the  corresponding  points  in  S3  having  then  no  special  relationship  to  the 
fundamental  quadric.  When  they  are  orthogonal  they  coincide  with  the 
Ricci  principal  directions  (Struik,  1927-8,  §4).  It  is  the  self-polarity  of  the 
Riemann  complex  with  respect  to  the  fundamental  tensor  that  separates 
the  15  sets  into  two  sets  of  6  and  9  as  described  above. 

Some  other  properties  of  the  self-polar  Riemann  complex  of  the  first  kind 
are  discussed  in  §§4,5  of  an  earlier  paper  (Ruse,  19446).  Lamson  (1930, 
p.  722)  gives  a  list  of  e-characteristics  which  are  algebraically  impossible 
for  such  a  complex.  They  are  twelve  in  number. 
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1  .    Introduction 

It  has  long  been  known  that  there  is  a  close  connexion  between  the  prime 
numbers  and  the  non-trivial  zeros  of  the  Riemann  zeta-function,*  but 
the  precise  nature  of  this  connexion  has  not  been  elucidated.  However, 
relations  are  known")"  between  certain  sums  involving  the  prime  numbers 
and  other  sums  involving  the  zeros  of  f  (s). 

In  this  paper  it  is  shown  that,  if  the  Riemann  hypothesis  is  true,  then 
there  exists  a  general  summation  formula  connecting  sums  of  the  above 
types.  With  appropriate  conditions  on/(#)  the  main  result  is 


o<miogi><T  P 

=  -(2flr)*lim(     S     9M~fTg(t)lo&l-dt},     (M) 

T->oolo<y<y  *7TjQ  277      J 

(2\i  /•->« 
~)  f(t)cosxtdt, 

p  runs  through  the  prime  numbers,  m  through  the  positive  integers,  and 
iy  through  the  non-trivial  zeros  §  of 


*  Ei.  Landau,  Handbuch  der  Lehre  von  der  Verteilung  der  Primzahlen,  1  (Leipzig, 
1909),  367-368. 

t  E.  Landau,  loc.  cit.  348-368,  and  A.  E.  Ingham,  The  Distribution  of  Prime 
Numbers  (Cambridge,  1932),  chapter  iv. 

f->co  rx 

%  Wo  use  the  notation  I         for   lim    I     . 
J  X-*ttJ 

§  If  multiple  zeros  of  £(«)  exist  then  the  corresponding  terms  g(y)  in  (1-1)  are  to 
be  multiplied  by  the  orders  of  the  zeros.  This  convention  will  be  used  throughout  the 
paper. 
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This  result  resembles  Poisson's  summation  formula,  and  the  method  of 
proof  used  here  was  originally  applied  to  the  latter  formula.  * 

An  expression  is  derived  for  the  remainder  in  the  Riemann-von  Mangoldt 
formula  t  for  N(T),  the  number  of  zeros  of  £(s)  in  0  <  I(s)  <  T. 

Some  examples  and  applications  of  the  summation  formula  (!•!)  are 
given  in  the  last  section. 

The  truth  of  the  Riemann  hypothesis  is  assumed  throughout  the  paper. 

2.   A  pair  of  Hankel  transforms 
Consider  the  integral 


AT  /2  Wsinz 

Now 


i  v  lx\*  r,  *         I2\*d  /sin 

and  hence  -     Jt(xt)  =  -   -     -,--  1  — 

\t/  \TT)  dt\    xt 

Also,  by  definition,  N(t)  =    S    1, 

0<y<< 

using  the  convention  explained  earlier  if  multiple   zeros   of  £(s) 
Hence  the  integral  (2-1)  may  be  written 

/2\*  (•->«(    _    ,      /  t  .       t       t\\d^inxt\, 

-fc)J,   (.^/-(s^sf-s?))*!-^)*- 

If  we  now  write  yr  to  distinguish  the  rth  value  of  y  ,  this  becomes 


occur. 


t        t 


277-  Jo 


ryy^  /y^^  , 
N+I    \  2" 


*  A.  P.  Guinand,  Annals  of  Math.  42  (1941),  591-603. 
f  E.  Landau,  loc.  cit.  chapter  xx. 
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/*—  >-00  An,  /*->00  J/y, 

Now  I        sinw  —  =  £77-,  sin^logiA  —  =  —  \TtG, 

where  C  is  Euler's  constant.  Hence  the  integral  term  in  (2-2)  is  equal  to 


and  (2-2)  becomes 

(-]*  lim  (i  I  sinsy,.     smay^/        y/m       yN+1     y^ 

W    *->«,Wfi       7r  tf/JV+i      \  27T         ^    27T  2/T 


Now,*  as  ^oo      ^(O-log--         -  0(log<).  (2-4) 

Hence,  as  N-+QO 

/         7jv+1        yAr+1 

^ 


and  (2-3)  becomes 


if  this  series  converges.  Now  it  is  known  thatj 


(2-6) 

where  the  dash  indicates  that  the  term  n  =  x,  if  it  occurs,  is  to  be  halved, 
and  p  runs  through  the  non-  trivial  zeros  of  £(s).  The  series  2  is  to  be 
interpreted  as  ,  ^  p 

lim         2        . 

T->  oo  I  -  T  <  /(p)  <  T  j 

Putting  5  =  -J,  a;  =  es,  and  p  =  ^  -f  ty»  tms  becomes 


2'    A(w)%-*-2e*2?-f-2e-*fl?--Jlogcothj2 


y>0      7 


*  E.  C.  Titchmarsh,  The  Z  eta-function  of  Riemann  (Cambridge,  1930),  4.  Referred 
to  in  the  sequel  as  Tract. 

f  E.  C.  Titchmarsh,  Tract,  81. 
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Hence  the  series  in  (2-5)  does  converge,  and  it  follows  that  the  integral 
(2-1)  exists.  Using  the  definition*  of  A(n)  we  obtain 

_  sinyz         1      _,     logp 
— --  - 


fM  v      i  <ntf 

y>0        /  "mlogp^z  P 

+ 1  log  coth  \z  4-  \  arc  tan  er&  —  \C  —  \Tt  —  \  log  STT,        (2-7) 

r'(\\ 
sincet  ^  =  i^7  +  i^  +  \  log  STT. 

Substituting  (2-7)  in  (2-5)  we  obtain 

THEOREM  1 .  //  the  Eiemann  hypothesis  is  true  and  x>Q,  then 

t        t\\dt 


J#)  +  arc  tan  e~*x  —  JTT}. 
If  we  put  f(x)  =  llN(x)-t£l^£*\\,  (2-8) 


+  -7-71  {t  log  (la?  coth  io?)  +  arc  tan  e~**  -  ITT},          .  (2-9) 

X{<t,i7Tj 

then  theorem  1  implies  that  $(#)  is  the  Hankel  transform  J  of  order  f  of  F(x). 
We  can  deduce  the  inverse  transformation  from  theorem  1  by  using  the 
following  lemmas: 

LEMMA  §  a.  //  F(x)  belongs  to  LP(Q,  oo)  for  some  p  in  1  <p  ^  2,  then  there 
exists  a  function  G(x)  belonging  to  Lp'(0,  oo)for  p'  —  pt(p  —  1),  defined  almost 
everywhere  by  ~T 

O(x)  =  l.i.m.       (xt)*J%(xt)  F(t)dt.  (2*10) 

r->oo  JO 
[T 

Further  F(x)  =  l.i.m.       (xt)*Ji(xt)  G(t)dt, 

T-+OO  JO 

*  A(n)  =  logp  when  n  —  pm  and  vanishes  elsewhere. 

t  This  follows  on  putting  s  =  £  in  the  functional  equation  for  £'(s)/£(s).  See 
A.  E.  Ingham,  loc.  cit.  73. 

J  E.  C.  Titchmarsh,  Introduction  to  the  Theory  of  Fourier  Integrals  (Oxford,  1937), 
214-216.  Referred  to  in  the  sequel  as  F.I. 

§  H.  Kober,  Quart.  J.  of  Math.  8  (1937),  186-199,  and  I.  W.  Busbridge,  ibid.  9 
(1938),  148-160. 
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and  iff(x),  g(x)  is  another  pair  of  such  transforms  for  the  same  value  ofp,  then 

("  f(x)  G(x)dx  =  f  * F(x)g(x)dx.  (2-11) 

Jo  Jo 

LEMMA*  /?.  //  F(x)  belongs  to  LZ(Q,  oo)  and  is  of  bounded  variation  in  some 
neighbourhood  of  x  =  y,  and  O(x)  is  defined  by  (2-10),  then 

rt__>OQ 

Jo 

Now  if  F(x)  is  defined  by  (2-8),  then  it  follows  from  (2-4)  that  F(x) 
belongs  to  I>(05oo)  for  any  p>  1,  and  the  conditions  of  lemmas  a  and  ft 
are  fulfilled  for  any  y  >  0.  Hence 


oth  It)  +  arc  tan  e~^  -  £77}  |  rf«. 


—  T 

We  can  deal  with  the  first  part  of  this  integral  as  we  dealt  with  the  integral 
(2-1).  The  second  part  of  the  integral  can  be  evaluated  by  partial  integration 
and  the  use  of  known  integrals.  We  obtain: 

THEOREM  2.  If  the  Riemann  hypothesis  is  true  and  T^O,  then 


r  i  A( 

oLn  =  l 


^_^  _  2^     +       ,       p(i  +  ^  -  T  log  T  +  ^} 


_       _ 

lOgiV 

+  -arctan277--^-arctan(sinh7rT),     (2-12) 

7T  47T 

where  am  F(i  4-  iT7)  t5  defined  by  making  am  F(^)  =  0  and  continuing  analytic- 
ally along  any  path  not  meeting  the  real  axis. 

Other  writers  have  remarkedf  previously  that  a  connexion  exists  between 
N(T)  and  the  first  series  in  (2-12),  but  as  far  as  I  am  aware  no  exact  formula 
of  this  type  for  N(  T)  has  been  given  before. 


*  E.  C.  Titchmarsh,  F.I.  83  and  266. 

t  E.  C.  Titchmarsh,  Tract,  63;  A.  Wintner,  Duke  Math.  J.  10  (1943),  99-105;  and 
A.  Selberg,  Avhandlinger  utgitt  av  detNorsfce  Videnskaps-Akademi  i  Oslo,  no.  1  (1944). 
1-27. 
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Now  it  has  been  shown*  that,  if  the  Riemann  hypothesis  is  true,  then 

as  N->ao  N 

S  A(n)  -  N  =  O(N*  log06  N)  (2-13) 

n  =  l 

holds  for  a  =  2,  and  also  that  the  expression  on  the  left  is 
n  ±  (N*  log  log  log  N). 

These  results  are  the  best  of  their  kind  yet  proved,  so  it  is  still  possible  that 
(2-13)  may  hold  for  some  a  in  0  <  a  <  2. 

If  we  put  s  —  Q  and  8  —  J  successively  in  (2-6),  then  we  can  show  that 

S  A(n)  TI-*  - 

=l 


=  0(1), 
since  £y~2  is  absolutely  convergent. 


N 


Hence  2  A(»)»-»-2tf»  =  0(log«N).  (2-14) 

7t-l 

If  this  holds  for  some  a  <  1,  then  in  theorem  2 


ln=l 

so  this  term  could  be  omitted  from  (2-11). 

3.    The  summation  formula 
We  now  need  the  following  lemmas  :f 

LEMMA  7.  If  f(x)  is  an  integral,  tends  to  zero  as  x  tends  to  infinity,  and 
xf'(x)  belongs  to  LP(O,QO)  for  some  p>l,  then  f(x)  belongs  to  Lp(0,oo)  and 
xltpf(x)  tends  to  zero  as  x  tends  to  +  0  or  to  infinity. 

LEMMA  d.  Iff(x)  satisfies  the  conditions  of  lemma  7  for  some  p  in  1 
then  it  has  a  Fourier  cosine  transform  g(x)  given  for  all  positive  x  by 

/2\*  f-*00 

f(t)cosxtdt. 


*  A.  E.  Ingham,  loc.  cit.  theorems  30  and  34. 
f  A.  P.  Guinand,  loc.  cit.  lemmas  2  and  4. 
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Further  f(x)  =  (-}  f^°°  g(t)cosxtdt, 

g(x)  is  an  integral,  both  g(x)  and  xg'(x)  belong  to  L*>'(0,  oo),  xwg(x)  tends  to 
zero  as  x  tends  to  +  0  or  to  infinity,  and  xf'(x),  xg'(x)  is  a  pair  of  Hankel 
transforms  of  order  f  in  the  sense  of  lemma  a. 

Now  if  f(x)  and  g(x)  satisfy  the  conditions  of  lemma  d  and  F(x)  and  G(x) 
are  defined  by  (2-7)  and  (2-8),  then  by  (2-10) 

(t°xft(x)G(x)dx^  (CGxg'(x)F(x)dx.  (3-1) 

Jo  Jo 

The  left-hand  side  of  (3-1)  is 


-**~j7rafo.    (3-2) 


Now  •-  {-J  log  (  J#  cot  h  \x  )  4-  ar  c  tan  e~*x  —  \TT} 


belongs  to  L^(Q,  oo)  for  any  p  >  1.  Hence  the  integral 

-i*-i7r}^  (3-3) 


converges  absolutely.   Integrating  by  parts,  the  integrated  terms  vanish 
by  lemma  y,  and  (3-3)  becomes 


Scothja;      1  +  e 

-^/."^(ro-i)*5-    (3'4) 

The  other  terms  in  (3-2)  are 


_  4  sinh  Ja; 


ji 

(27T)»  r_>«  IL-7  ^    ^  \0<mtgp<.^  P*™  /JO 

T 


-  f  V)  d(        ^        ^)  +  2  \Tf(x)  cos 

Jo  \0<mloBp<x  P*      I          JO 

10gP 

' 


x-  (3'5) 

SER.  2.     VOL.  50.     NO.  2386.  H 


114  A.  P.  GUINAND  [15  Nov. 

since  the  last  integral  converges  absolutely.  Adding  (3*4)  and  (3-5)  we  find 
that  (3-2)  is  equal  to 


-  r 

Jo 


1         f°°  If00/     f&%  1\ 

-To-xi          /(*)«'**  ^+«7^-U          /(^)hV--- 

(27r)*J0  2(27T)*J0        '\smha;     a;/ 

The  sum  of  the  last  two  integrals  is  equal  to 


Now  consider  the  right-hand  side  of  (3-1).  It  is  equal  to 


O<7<!T 

since  the  integrated  terms  vanish  by  (2-4)  and  lemma  8.  Hence  we  have, 
altogether: 

THEOREM  3.  //  (i)  the  Riemann  hypothesis  is  true, 

(ii)  f(x)  is  an  integral,  tends  to  zero  as  x  tends  to  infinity,  and  xf'(x)  belongs 
to  L*>(0,  oo)  for  some  pin  1  <p  ^  2,  and 


g(x)  =  /^)  cos  xtdt, 


then 


lim  I        S         -^/(^log^)-  [*  f(x 

T->c«  lO<mlogi)<r  ^*  Jo 


-(2fr)*liin       S    ^(y)-(*)logdte.     (3-6) 
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We  can  simplify  this  result  slightly  by  making  further  assumptions. 
By  lemma  7  and  (2-14) 


J(T)( 

\Q< 


<T  P 

=  o(T*-w*»).  (3-7) 

Hence: 

THEOREM  3'.  //,  in  addition  to  the  assumptions  of  theorem  3,  either  (2-13) 
holds  for  CL  —  l/p  or  T2f(T)  tends  to  zero  as  T  tends  to  infinity,  then  the  term 
(3-7)  can  be  omitted  from  (3-6). 

Further,  if  it  is  desired,  the  left-hand  side  of  (3-6)  can  be  expressed  in 
terms  of  A(n).  Putting  x  =  log  t,  U  =  eT  we  have 


lim  f         S         l^ij£f(™  log JP)  -  f  Tf(x)  e**  dx] 

T->oo  VO<mlogD<T  P  JO  j 


4.    Examples  and  applications 

Numerous  examples  of  functions  to  which  the  formula  (3-6)  is  applicable 
can  easily  be  found,  corresponding  to  the  many  known  applications  of 
Poisson's  summation  formula.*  Two  examples  chosen  for  their  special 
interest  are  given  here. 

(A)    Suppose^  J  <  JR(s)  <  1,  and 


)  satisfies  the  conditions  of  theorem  3  for  1  <p<  l/JR(s),  andf2(x) 
satisfies  them  for  p  =  2.  Hence  the  result  (3-6)  holds  for  both  functions, 
and  by  addition  it  also  holds  for  the  function 


(2\i  /*->oo 
—  I    I        t~-*cosxtdt 


*  E.  C.  Titchmarsh,  F.I.  64-65. 
t  Cf.  A.  P.  Guinand,  loc.  cit.  597. 

H2 


116  A.  P.  GUINAND  [15  Nov. 

Also  if  we  write  for  the  infinite  integral  in  (3-6) 

i  r°°      /i  •   *> 
/(*)=i      x~s(L_4 
v  '      2J0        \x     s 


then  this  defines  an  analytic  function  of  s  for  0  <  E(s)  <  I,  and  in  this  strip 
If1       /I       e~*x  \          If00    __l_8          1  r«>x-8e-*x 

1  /*i       /i       e~§a;  \  11  C^x~se~^x  , 

"~  2 Jo        \a;     sinha;/  25     2ji    sinh# 

This  defines  the  analytic  continuation  of  I(s)  for  j5(s)  <  0,  where  it  is 
^~"2J0  2s     2j0    sinhrr 

J  o  ex  —  e~x 

CD       f  CO 
__    V1       I 

nt'ojo 


Now,  if  we  write 


n=0 

then          £  (4n+l)-»  =  4^(«)  +  i  2  (4^4-l)-8  +  i  2  (4w  +  3)-"-l 

n=l  -^w  =  0  ^w=0 


Hence  /(*)  =  F(l 

If  (2-13)  holds  for  some  a  <  R(s),  then  (3-6)  gives 


(/p«  rp          Jfs  \ 

2    y'"1-^—  log^-  +  ;r—i 
Q<y<T  2™  27T       27rS) 
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(B)   Put  #>0  and 

(cosyx  —  cosyX     (Q^x^X), 

/(*)  =  to  (.>*). 

The  conditions  of  theorem  3'  are  satisfied,  and 

g(x)  =  (  -  1  ~—^  —  gr  (x  sin  yX  cos  xX-y  cos  2/Jf  sin  xX) 

\iTj  x(y  —x) 

if  x  4=  y  .  If  #  =  y  then  this  becomes 


Substituting  this  in  (3-6)  we  obtain 

-^r  {cos  (^m  los^)  ~  cos  yx}  ~    (cos  ^  ~"  cos  yx)  ^X  dx 

P*  Jo 


0<mlogp<X  P* 


=  -  2  s  -/—  2~  —  2\  (^  sin  2/^-  cos  yx~y  cos 

r>o7(2/  ~7  ) 

+  1  f  °°        ^__^  (^  sin  ?/JC  cos  x  X  -  y  cos  yX  sin  #  JC)  log  ^-  dx,     (4-2) 
TrJo  ^2-a;2)v         J  *K 


where  the  term  y  —  y,  if  it  occurs,  is  to  be  interpreted  as  in  (4-1). 
Now  let  X  tend  to  infinity.  For  fixed  y 


Also 


cos  yx-y  cos  ^  si 


-71         y>o,  y=H/7!2/-/l 
=  0(1), 
since  the  latter  series  converges.*  The  term  y  -  y,  if  it  occurs,  gives  a  term 


*  A.  E.  Ingham,  loc.  cit.  theorem  256. 
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Further 

1      /•  00  a  i  % 

-       —7-ir  —  5;  (#  sin  y%  cos  xX-y  c°s  yX  sin  xX)  log  —  dx 
njo  x(y*  —  x£)  *n 


1  f-sinajJK:,       3,1     f  *  sin  (a?  -  y)  X  ,       a? 

—  y  cos  v-X       -7  -  —,  log  ^r-  ^  +  -  y\     ~-~  —  ^  —  log  — 

Tr^  JO    »(y  +  »)          2^T  TT^Jo         (^2-2/2)  &27T 

1  ^f^sinicX,       a;   ,       1  vr«>sinxX. 

-  cos  yX       -  log  —  dx  ---  cos  yX      --  log  - 

^  Jo      y  +  X        627T  7T  ^      J0  X  & 


«>sinxX.       x   , 
dx 


By  Fourier's  single  integral  theorem*  the  first  of  these  integrals  tends 
to  zero  as  X  tends  to  infinity,  and  the  last  tends  to 


There  remains  in  (4-3)  • 

1  vC«>BmxX.       x   ,  1  __f°°sin$.      /    t    \ 

—  cosyJC       —  —  log^r-aa?  =  —  cos^/A       —  —  log  \^—^\ 

TT  Jo  X  &27T  7T  Jo        t          *  \27TA  / 


Hence  (4-2)  becomes 

}— 

Jo 


0<mlogp<X 


Hence  we  have: 

THEOREM  4.  //  the  Riemann  hypothesis  is  true,  then  as  X  tends  to  infinity 


-cos  T/^          2  _ 

p<XJP* 
(y-y), 


*  E.  C.  Titchmarsh,  J?1./.  theorem  12. 
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Inversely,  if  we  interchange  f(x)  and  g(x)  we  can  prove: 

THEOREM  5.  If  the  JKiemann  hypothesis  is  true,  then  as  X  tends  to  infinity 


O(logX) 
However,  this  result  is  a  simple  consequence  of  the  known  result*  for  x  >  1 


New  College 
Oxford 


*  E.  C.  Titchmarsh,  Tract,  theorem  42. 
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ON  ANALYTIC  FUNCTIONS  POSSESSING  CERTAIN 
PROPERTIES  OF  UNIVALENCY 

By  ZEEV  NEHARI 
[Received  2  September  1945 — Read  15  November  1945] 

1.  In  the  present  paper,  various  classes  of  analytic  functions  possessing 
certain  properties  of  univalency  are  investigated.  Throughout  the  paper, 
ample  use  is  made  of  the  principle  of  subordination  (principle  of  the  majorant 
function)*  which  appears  to  be  particularly  suited  to  the  treatment  of  the 
problems  under  discussion.  In  all  cases,  **  best  possible  "  results  are  obtained 
— a  characteristic  feature  of  the  majorant  method  when  appropriately  used. 

The  functions  discussed  will  usually  be  of  the  form 

w=f(z)  =  z  +  a2z2+...,  (a) 

and  will  be  assumed  to  be  regular  in  the  unit  circle  |  z  \  <  1 .  In  order  to 
submit  the  properties  of  such  functions  to  a  closer  study,  the  very  great 
generality  still  allowed  by  the  class  of  functions  (a)  is  usually  qualified  by 
various  additional  assumptions  which  lead  to  more  specialized  classes  of 
functions  /(z).  One  of  the  moat  fruitful  of  these  additional  assumptions  is 
the  condition  that  the  domain  D  on  which  w  =  f(z)  maps  the  unit  circle 
should  be  "schlicht",  i.e.  should  not  overlap  itself.  Functions  f(z)  satisfying 
this  condition  form  the  class  S  of  univalent  or  "schlicht"  functions  which 
has  been  thoroughly  studied  and  for  which  a  great  many  important  results 
have  been  obtained.  Most  noteworthy  among  these  results  is  the  classical 
distortion  theorem  according  to  which  any  point  w  =  d  outside  D  satisfies 
the  inequality  |  d  \  ^  J. 

Now  this  condition  of  D  being  "schlicht"  can  be  generalized  in  many 
ways.  Some  of  these  generalizations  will  be  investigated  in  the  following, 
and  it  will  be  shown  that  in  spite  of  considerably  weaker  assumptions 
results  of  the  type  of  the  distortion  theorem  still  hold.  As  may  be  expected, 

*  For  a  detailed  account  of  the  principle  of  subordination  and  its  many  applications 
see  Littlewood,  Lectures  on  the  Theory  of  Functions  (Oxford,  1944). 
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the  constant  J  will  have  to  be  replaced  in  these  cases  by  suitable  smaller 
constants. 

2.  In  the  first  theorem  to  be  proved,  the  condition  that  D  be  "schlicht" 
will  be  replaced  by  the  much  weaker  assumption  that  there  exists  a  point 
d  on  the  boundary  of  D  such  that  w  =  d  may  be  connected  with  w  =  0  by 
a  "schlicht"  curve,  i.e.  a  curve  covered  by  D  exactly  once.  Stating  the 
latter  condition  in  a  different  way,  we  may  say  that  for  all  points  w'  on  this 
curve  the  equation  f(z)  —  w1  shall  have  exactly  one  solution  satisfying 

\Z\<1. 

THEOREM  I.    Let 

w  =f(z)  =  z  +  oaza+... 

be  regular  in  the  unit  circle  and  let  D  denote  the  domain  on  which  I  z  I  <  1  is 
mapped  by  w  =/(z);  let,  further,  w  =  d  be  a  point  on  the  boundary  of  D  which 
may  be  connected  with  w  =  0  by  a  "schlicht"  curve.  Then 


This  inequality  is  the  best  possible. 

Before  we  begin  the  proof  of  this  theorem,  we  discuss  the  special  case  in 
which  the  curve  in  question  is  the  stretch  0  <  w  <  d  (d  may  be  assumed  to 
be  positive  without  loss  of  generality).  If  we  consider  the  function 


we  see  without  difficulty  that  w  =  g(z)  is  subordinate  to  the  function 
w  =  fj(z)  =  az+  ...  which  maps  |  z  \  <  1  on  a  Riemann  surface  D  of  the 
following  description:  D  covers  the  whole  finite  w-plane  an  infinity  of  times, 
with  the  exception  of  the  stretch  0  ^  w  <  1  which  is  covered  only  once  and 
the  points  w  —  1  and  w  =  oo  which  are  outside  D  (i.e.  D  has  no  boundary 
points  not  satisfying  either  O^w^l  or  w  =  oo);  furthermore,  D  is  every- 
where "locally  schlicht",  i.e.  has  no  inner  branch-points.  Here  we  should 
show  that  D  is  simply-connected,  but  this  may  be  dispensed  with,  as  we 
shall  presently  construct  the  function  w  =  $(z)  which  maps  |  z  \  <  1  on  D. 

Since  both  w  =  g(z)  and  w  =  %(z)  vanish  for  z  =  0  and  the  conformal 
representations  of  the  unit  circle  by  both  functions  cover  the  stretch 
0  ^  w  <  1  in  a  "schlicht"  manner,  the  function 


is  regular  —  and  [  w(z)  \  ^  1  —  for  those  values  z  which  correspond  to  0  ^  g(z)  <  1  . 
Since  no  other  value  taken  by  w  =  g(z)  (for  \z\<  1  )  coincides  with  a  boundary 
point  of  D  (i.e.  0^w<  1,  w  =  oo)  or  a  branch-point  (of  which  D  has  none), 
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we  may  conclude  that  w(z)  is  uniform  and  |  o)(z)  |  ^  |  z  |  for  |  z  \  <  I  or,  in 
other  words,  that  g(z)  is  subordinate  to  $(z).  The  developments  of  g(z) 
and  $(z)  being  g(z)  =  zfd  +  ...  and  $(z)  —  az+  ...  respectively,  this  entails 


We  now  proceed  to  construct  the  function  w  =  $(z)  which  maps  |  z  \  <  1 
on  this  surface  D.  To  this  end  we  consider  the  conformal  mapping  properties 
of  the  function  w  =  sin  x.  This  function  maps  the  half-strip 


on  the  half-plane  ${w}  >  0,  the  points  x  =  \n>  —  JTT,  oo  and  w  —  1,  —  1,  oo 
corresponding  respectively.  As  both  the  half-strip  and  the  half-plane  are 
bounded  by  straight  lines,  the  function  w  —  sin  x  can  be  analytically 
continued  with  the  help  of  Schwarz's  principle  of  symmetry.  By  inverting 
the  half-strip  continually  along  the  rays  $t{x}  =  —  -|TT,  0  <  ${x]  <  oo  and 
$t{x}  —  £TT,  0  <  ${#}  <  °°  and  the  rays  corresponding  to  them  in  the  half- 
strips  so  obtained,  the  whole  half-plane  ^{x}  >  0  will  eventually  be  covered 
(in  a  "schlicht"  manner).  In  the  w;-plane,  the  corresponding  inversions 
will  be  made  with  regard  to  the  rays  —  oo  <  w  <  —  1  and  1  <  w  <  oo.  The 
function  w  =  sin  x  will  therefore  map  the  half-plane  ${#}  >  0  on  the  whole 
finite  w-plane,  covered  an  infinity  of  times,  with  the  exception  of  the 
stretch  —  1  ^  w^  1  which  is  not  covered  at  all.  All  boundary  points  of  this 
surface  satisfy  either  —l^w^l  or  w  =  oo.  We  now  use  the  symmetry 
principle  once  more,  this  time  with  regard  to  the  stretch  —  JTT  <  x  <  \n.  In 
the  it-plane  we  thus  obtain  the  entire  plane  slit  along  the  rays  —  oo  <  x  <  —\TT 
and  \TI  <  x  <  oo  ;  in  the  w-plane  the  result  is  a  surface  D'  very  nearly  related 
to  D  (D'  leaves  out  the  points  w  =  ±  1  ,  oo  and  covers  simply  the  stretch 
—  1  <w<  I),  Since  the  conformal  representation  of  |  z  \  <  1  on  an  #-plane, 
cut  in  the  way  just  described,  is  effected  by  the  function 


it  follows  that  w  —  (&(z)  —  sin    -  —  ^    maps  the  unit  circle  on  D'  .  Now  the 


relation  between  (&(z)  and  the  function  g(z)  mapping   |z|<l  on  D  is 
],  this  transformation  obviously  linking  D  and  D'.  Hence 


Combined  with  (1),  this  finally  yields 

,       1 
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So  far  the  simply  covered  curve  has  been  the  stretch  0  <  w  <  d.  If  we  now 
turn  to  the  general  case,  we  start  by  considering  the  function 


which  is  also  regular  for  \z\<  l,/(z)  not  vanishing  there  except  for  z  =  0. 
w  —  g(z)  does  not  take  the  values  w  =  ±  1  and  both  w  —  1  and  w  =  —  1  can 
be  connected  with  w  =  0  by  curves,  say  Cj  and  c2,  which  are  simply  covered 
by  the  surface  on  which  w  =  y(z)  maps  |  z  \  <  1.  w  =  g(z)  being  an  odd  func- 
tion of  z,  cx  and  c2  are  symmetrical  with  regard  to  w  =  0.  Let  now  E  be  the 
surface  arising  from  the  surface  D'  considered  earlier  by  deforming  the 
stretch  —  1  <  w  <  1  so  as  to  take  the  shape  of  the  combined  curves  c±  and  c2, 
i.e.  E  is  locally  "schlicht",  leaves  out  the  points  w  =  ±  1,  oo,  covers  the 
curves  ct  and  c2  only  once,  and  all  finite  boundary  points  of  E  lie  on  Cj 
and  ca.  Since  E  is  obtained  from  D'  by  a  continuous  deformation  of  the 
w  plane,  E  is  also  simply-connected  and  can  therefore  be  obtained  as  a 
conformal  representation  of  the  unit  circle  by  a  function  w  —  ($(z)  =  0z  +  .  .  .  . 
Considerations  similar  to  those  applied  earlier  to  the  domain  D'  show  that 

w  =  g(z)  =  -7=  -f-  ...  is  subordinate  to  w  =  Qb(z)  —  /3z+  ...,  whence  we  obtain 


In  order  to  find  an  upper  bound  for  \fi  \  ,  we  consider  the  function 

2  2/? 

u  —  h(z)  —  —  arcsin<$(2)  =  —  z+  ..., 

7T  7T 

which  is  regular  and  uniform  for  |  z  \  <  I  ,  since  the  critical  points  of 
u  =  arc  sin  w,  viz.  w  =  ±  1,  oo,  are  left  out  by  w  =  (&(z).  The  simplycovered 
curves  cx  and  c2  are  transformed  by  the  main  branch  of  h(z)  into  two  other 
simply  covered  curves  c{  and  c'2  connecting  u  —  1  and  u  —  —  1  with  u  =  0. 
Since,  for  values  w'  on  the  curves  cx  and  c2,  the  equation  w'  —  f(z)  had  only 
one  solution  z  satisfying  |  z  \  <  1,  the  analytical  continuation  of  u  =  h(z) 
for  all  values  |  z  \  <  I  leaves  out  all  points  ur  +  2n  (n  —  ±  1,  ±2,  ...),  where 
u'  is  any  point  of  c(  or  c'z.  We  may  therefore  conclude  that  the  points  u  —  ±  I 
belong  to  the  "outer  boundary"  of  the  domain  F  on  which  u  =  h(z)  maps 
the  unit  circle,  i.e.  u  —  ±  1  may  be  connected  with  u  —  oo  by  curves  p± 
and  p2  respectively,  not  belonging  to  F.  u  —  h(z)  being  an  odd  function 
of  z,  p±  and  p2  may  be  chosen  symmetrical  with  regard  to  u  —  0.  u  —  h(z) 
is  therefore  subordinate  to  the  odd  "schlicht"  function  u  =  k(z)  —  yz+  ... 
mapping  |  z  \  <  I  on  the  entire  z-plane  slit  along  p^  and  p2.  Now,  by  the 
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distortion  theorem,  odd  "schlicht"  functions  u  —  k(z)  =  yz+  ...  not  taking 
(for  |  z  |  <  1)  the  values  u  =  +  1  satisfy  |  7  |  ^2.  The  function 

u  =  h(z)  =  2/?z/7T+... 
being  subordinate  to  u  =  k(z),  this  entails 


7T 

Combined  with  (2),  this  finally  yields 

1          1 


It  is  clear  that  the  case  d  =  1/Tr2  can  only  occur  for  one  of  the  functions 


where  |  K  \  =  1.  This  completes  the  proof  of  theorem  I. 

3.  The  hypothesis  of  the  next  theorem  will  be  half-way  between  that  of 
theorem  I  and  the  assumption  that  the  function  considered  is  "schlicht". 

THEOREM  II.  Let  w  =/(z)  =  z  +  aaz2+  ...  be  regular  for  \  z  \  <  1,  and  let  D 
denote  the  domain  on  which  w  ~  f(z)  maps  the  unit  circle;  let  further  w  =  d 
be  a  point  of  the  boundary  of  D.  If  the  interior  of  the  circle  \w\  —  \d\ia  covered 
by  D  in  a  "schlicht"  manner,  i.e.  if  for  \  w'  \  <  |  d  \  the  equation  w'  —  f(z)  has 
exactly  one  solution  z  satisfying  \z\<l,we  have 


This  inequality  is  the  best  possible. 

The  proof  of  this  theorem  is  a  straightforward  majorant  affair.  If  (without 
loss  of  generality)  d  is  assumed  positive,  it  is  clear  that  the  function 


is  subordinate  to  the  function  w  =  %(z)  =  dz+  ...  mapping  the  unit  circle 
on  a  surface  D  defined  as  follows:  D  is  locally  "schlicht"  and  covers  the 
finite  w-plane  an  infinity  of  times,  with  the  exception  of  the  circle  |  w  \  <  1 
which  is  covered  only  once  and  the  points  w  =  1  and  w  =  oo  which  are  not 
covered  at  all.  All  boundary  points  of  D  satisfy  either  \w\  —  1  or  w  =  oo. 
D  is  simply-connected,  as  will  be  shown  by  actually  finding  the  function 
w  =  $(z)  =  az  -h  .  ..  mapping  |  z  \  <  I  on  D. 

w  =  g(z)  =  z/d+  ...  being  subordinate  to  w  =  2r(z)  =  <xz+  ...,  we  have 

(3) 


1945]  ON  UNIVALENCY  OF  ANALYTIC  FUNCTIONS  125 

Now  it  is  easily  seen  that  %(z)  is  of  the  form 


Indeed,  for  |  2  |  =  1  (and  24=  —  1)  we  have  |  gf(z)  \  —  1,  and  3r(z)/z  does  not 
vanish  for  |  z  \  <  1.  Obviously  w  =  %(z)  takes  all  values  |  w  \  <  I  only  once 
for  |  z  |  <  1.  The  derivative  of  Qr(z)  is 

.I^)(^)2.  (4) 

As  this  expression  does  not  vanish  except  for  2=1,  it  follows  that 
z  exp  1  2  .  -  -  1  maps  |  z  \  <  1  on  a  locally  "  schlicht  "  surface,  say  D.  Further- 

more, (4)  shows  that  $'(z)  has  a  double  zero  at  z  —  1.  At  this  point  the 
curve  into  which  w  ~  £J(z)  transforms  |  2  |  =  1  turns  by  an  angle  of  27r. 
Because  of  $(1)  =  1  this  means  that  the  point  w  —  1  is  not  covered  by  the 
particular  sheet  of  the  surface  D  in  which  He  the  points  w  =  %(0),  where 
6  is  positive  and  slightly  smaller  than  1.  If  w  =  1  were  covered  by  another 
sheet  of  D,  the  same  would  be  true  of  the  points  w  =  ?$(0).  However,  all 
points  |  w  |  <  1  are  taken  by  w  —  $(z)  only  once  and  therefore  we  have 
F(z)*l  for  |z|<l. 

The  function       w  —  $(z)  —  z  exp  1  2  .  -—  j  =  e2z  -f  .  .  . 
mapping  |  z  \  <  1  on  the  surface  D  described  above,  we  have,  by  (3), 


Here,  equality  is  only  possible  for  functions  of  the  form  ~^^^(Kz)  with 


This  completes  the  proof. 

4.  Theorems  I  and  II  both  occupy  intermediate  positions  between  the 
classical  distortion  theorem  on  the  one  hand,  and  Hurwitz's  theorem  on 
functions  "schlicht  in  one  point"  on  the  other.  If  written  in  the  order  of 
increasing  generality,  these  four  theorems  may  be  enunciated  as  follows  : 

Let  w  =  f(z)  be  an  analytic  function,  regular  for  \  z  \  <  1  ,  and  let  w  =  d  be 
a  point  on  the  boundary  of  the  domain  D  on  which  the  unit  circle  is  mapped 
byw=f(z).  Then: 

(a)  if  the  whole  domain  D  is  "schlicht",  we  have 
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(6)  if  the  circle  \w\<\d\  is  covered  by  D  in  a  "schlicht"  manner,  we  have 

!'!>?; 

(c)  if  a  curve  connecting  w  =  d  with  w  =  0  is  covered  by  D  in  a  "schlicht" 
manner,  we  have  1 


(d)  if  the  point  d  is  covered  by  D  in  a  "schlicht"  manner,  we  have 

\*\*#- 

In  the  last  case  it  has  to  be  further  stipulated  thatf(z)  J=dfor  \z\<l. 

5.  The  object  of  this  section  will  be  a  generalization  of  theorem  I.  In 
addition  to  the  hypotheses  of  theorem  I  we  shall  further  assume  that  the 
function  w  =  f(z)  —  z+  ...  is  bounded  by  a  positive  number  M  (  >  1),  i.e. 

\f(z)\*M,     \z\<\. 

It  is  clear  that  this  additional  condition  will  result  in  an  increased  value 
of  the  lower  bound  for  the  distance  of  the  boundary  point  w  =  d  from  the 
origin. 

Following  the  usual  procedure  in  the  case  of  bounded  functions,  we  shall 
divide  the  function  f(z)  by  M  and,  in  the  sequel,  deal  with  functions 
w  =  f(z)  =  az+  ...  satisfying  |  f(z)  |  <  1  for  |  z  \  <  1.  The  theorem  to  be 
proved  may  then  be  enunciated  as  follows  : 

THEOREM  III.  Let  w  —  f(z)  =  az-\-  ...  be  an  analytic  function,  regular  for 
|  z  |  <  1  and  satisfying  there  \  f(z)  |  <  1;  let  further  d  (0  <  \  d  \  <  1)  be  a  point 
of  the  boundary  of  the  domain  D  on  which  |  z  |  <  1  is  mapped  by  w  =  f(z),  such 
that  w  =  d  may  be  connected  with  w  =  0  by  a  "schlicht"  curve.  Then  we  have 


(5) 
where  r  (r  >  0)  is  given  by  the  equation 


&2and'd'3beingtheJacobian'd'-functions.  This  bound  for  \  a  \  is  the  best  possible. 

This  result  can  also  be  interpreted  in  the  opposite  way,  namely,  if  r  is 
determined  by  assuming  equality  in  (5),  then  the  right-hand  side  of  (6) 
gives  a  lower  bound  for  |  d  \  . 

Before  we  enter  upon  the  proof  of  theorem  III,  we  remark  that  for  I/a  f  oo 
theorem  I  is  obtained  as  a  limiting  case  of  theorem  III.  In  fact,  if  we  denote 
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by  (e)  quantities  that  tend  to  0  for  T  f  oo,  the  asymptotic  expressions  for 
the  ^--functions  are 


and    «j(0;  r)  =  16«-*H  +  (e)]. 

Assuming  equality  in  (5),  the  case  1/afoo  corresponds  therefore  to  rfoo, 
and  in  view  of 


~ 


rr2 


I  d  I  4e~4r  1 

we  obtain  |  d'  |  =  J_j  >  j^-^-j^  +  (e)  =  -t  +  (e), 

which  is  equivalent  to  theorem  I. 

The  proof  of  theorem  III  resembles  very  closely  the  proof  of  theorem  I. 
We  shall  first  assume  the  "schlicht"  curve  to  be  the  stretch  Q^w<d, 
where  d  is  again  assumed  positive  without  loss  of  generality.  The  role  of 
the  surface  D  will  here  be  played  by  a  surface  E  obtained  from  D  by  con- 
tracting it  in  the  ratio  1  :  d  and  then  cutting  away  the  outer  parts  of  all  its 
sheets  along  the  circle  \w\  =  1.  The  (locally  "schlicht")  surface  E  may 
consequently  be  defined  as  follows  :  E  covers  the  circle  |  w  \  <  1  an  infinity 
of  times,  apart  from  the  stretch  O^w  <d  which  is  covered  by  E  exactly 
once  and  the  point  w  --=  d  which  is  not  covered  at  all.  E  is  simply  -connected 
as  is  clear  from  the  manner  by  which  it  was  obtained  from  the  simply- 
connected  surface  D.  Obviously,  the  function  w  =  f(z)  =  az+  ...  is  sub- 
ordinate to  the  function  w  =  $(z)  —  flz  +  ...  mapping  |  z  \  <  1  on  E,  whence 
we  obtain  |  a  \  ^  |  ft  \  . 

Our  next  aim  is,  therefore,  to  find  an  analytical  expression  for  the  function 
w  =  ?$(z)  or,  at  least,  for  its  first  coefficient  /?.  For  this  purpose  we  consider 
the  conformal  mapping  properties  of  the  Jacobian  elliptic  function 
w  =  sn  x  =  x  +  .  .  .  satisfying  the  differential  equation  w'2  =  (  1  —  w2)  (  I  —  k2w2). 
If  4:K  and  2iK'  (K,Kr  real)  denote  the  elementary  periods  of  w  =  &nx, 
this  function  maps  the  rectangle  [_  —  K;  K\  K  +  iK'\  —  K  +  iK']  in  the 
re-plane  on  the  half  -plane  ^{w}  >  0,  the  points 


_ 

w  -I 


K  +  iK' 


0 


0 


iK' 


corresponding.  The  expression  of  k  in  terms  of  the  period  ratio  r  —  K'/K  is 
of  the  form 

^(Ojjr) 

«S(0;r)' 

where  #«  and  &»  are  the  Jacobian  #-f unctions. 
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In  order  to  continue  w  =  sn  x  analytically  for  the  whole  strip  0  <  ${x]  <  Kr, 
we  again  use  Schwarz's  symmetry  principle.  Since  to  the  inversions  of  the 
mentioned  rectangle  with  respect  to  the  stretches  $l{x}  =  K,  0  <  ${#}  <  K' 
and  $t{x}  =  —K,  0  <  ${x}  <  K'  correspond  inversions  of  the  upper  half- 
plane  ${w}  >  0  with  respect  to  the  stretches  1  <  w  <  k~l  and  —  k~l  <  w  <  —  1 , 
it  follows  that  w  =  sn#  maps  the  strip  0  <  ${x}  <  K'  on  a  locally  "  schlicht " 
surface  covering  the  finite  w-plane  an  infinity  of  times,  with  the  exception 
of  the  stretch  —l^w^l  and  the  rays  —ao^w^  —  k"1  and  fc"1  ^w^co 
which  are  not  covered  at  all.  Another  application  of  Schwarz's  symmetry 
principle,  this  time  with  respect  to  the  stretch  —K<x<K,  shows  that 
w  =  sn  x  maps  the  strip  —  K'  <  ${x}  <  K' ',  cut  along  the  rays  —  oo  ^  x  ^  —  K 
and  K <#<oo  on  the  locally  "schlicht"  surface  Ef  defined  as  follows:  All 
boundary  points  of  W  satisfy  one  of  the  inequalities 


the  rays    —  oo^w^  —k~l  and  fc"1  <  w  <  oo  and  the  points  w  =  ±  1   are 
outside  E,  and  the  stretch  —  1  <  w  <  1  is  covered  by  one  sheet  of  E'  only. 
Now  this  surface  E'  has  many  traits  in  common  with  the  surface  we 

2u 
attempt  to  construct.  In  fact,  since  the  function  w  =  j-~ ^  maps  |  u  \  <  1 

on  the  entire  w?-plane  cut  along  the  rays  —  oo  <  w  <  —  &""1  and  k~l  <  w  <  oo, 
we  may  infer  that  the  function  u  —  p(x)  =  \kx  +  . . .  defined  by 

o^/^.\ 

--=snz  (7) 


maps  the  strip  —  K'  <  ${#}  <  K',  cut  along  the  rays  —co^x^  —K  and 
K^x^ao,  on  a  surface  E"  connected  with  E  by  a  root  transformation, 
i.e.  if  u  =  (&(z)  maps  |  z  \  <  1  on  E"  and  u  =  $(z)  maps  |  z  \  <  1  on  E,  the 
relation 


holds.  Indeed,  in  view  of  (7)  the  surface  E"  on  which  u  =  p(x)  maps  the 
strip  in  question  covers  |  u  \  <  I  an  infinity  of  times,  apart  from  a  stretch 
—  #  <  u  <  d  (8  to  be  defined  presently)  which  is  covered  by  E"  exactly  once 
and  the  points  u  =  ±  S  which  are  not  covered  at  all.  A  glance  at  (8)  shows 
that  3  =  <Jd,  where  w  —  d  was  the  point  left  out  by  w  =  g(z)  (|  z  \  <  1).  In 
view  of  (7),  the  modulus  k  of  the  elliptic  function  is  therefore  determined  by 


»§(0;r) 
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Since  the  modulus  of  r  is  connected  with  the  modulus  belonging  to  2r  by 
the  relation  2<J[k(2r)]  _  ..  . 

l+k(2T)  ~    (T)' 
this  may  also  be  written          *Jd  = 


'-*<*>-$?$• 

In  order  to  complete  the  construction  of  the  function  $(2),  we  have  now 
to  find  a  function,  say  x  =  r(z),  mapping  z  \  <  I  on  the  strip  —K'<  £${x}  <  K ', 
cut  along  the  rays  —  oo^x^  —K  and  K^x^oo',  $(z)  will  then  be  of  the 
form  gr(z)  =  pz[r(*Jz)].  But  this  function  can  be  built  up  by  elementary 
conformal  representations.  The  function 


maps  |  £  |  <  1  on  the  strip  —  K1  <  ${x}  <K'.  If  we  cut  this  strip  along  the 
rays  —  oo^cc^  —  K  and  K^x^co,  the  corresponding  cuts  in  the  £-plane 
will  be  —  1  <;  £  ^  —  9  and  0  ^  £  ^  1 ,  0  being  defined  by 

2K'        l  +  0_  K 

77  1  —  6 

i.e.  6  =  tanh  ^     [T  =  K' ]K\. 

The  last  step  will  now  be  to  find  the  function  £  =  h(z]  mapping  |  z  \  <  1  on 
the  circle  |  £  |  <  1  cut  along  the  stretches  —  1  ^  £  ^  —0  and  0  ^  £  ^  1 .  But  it 
is  easily  seen  that  this  conformal  representation  is  effected  by  the  function 


where  g(z)  =   —         and     A  = 


The  first  coefficient  of  the  power  series  of  h(z)  —  which  is  all  the  information 
we  need  for  our  purpose  —  is 

.          W  2  tanh  JTT/T         ,      ,  ,     . 

A  =  -  —  ^9  —   —  -  —  ,017"  —  tanhiTT/r. 
1  +  0*      l+tanh2j7r/r  2   ; 

The  development  of  the  function  r(z)  will  therefore  start  with 

.  .       2K' 
r(z)  =  _lo 

being  of  the  form  p2{r(*Jz)},  we  obtain  finally 
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In  view  of  fc  =  and     K>  =      *i(0;  T)> 


the  first  coefficient  of  $(z)  may  also  be  written 
P  =  72#*(0;  r)tanh2  JTT/T. 

This  completes  the  proof  in  the  case  in  which  the  "schlicht"  curve  con- 
necting w  —  0  with  w  =  d  was  the  stretch  0  ^  w  <  d.  The  extension  of  this 
result  to  the  case  of  a  general  "schlicht"  curve  connecting  the  points  w  =  0 
and  w  =  d  can  be  achieved  in  very  much  the  same  way  as  in  the  proof  of 
theorem  I.  The  place  of  the  function  2/7T  sin"1  (w)  used  there  will  be  taken 
here  by  the  inverse  of  the  elliptic  function  we  utilized  for  the  proof  of  the 
particular  case  of  theorem  III.  Instead  of  the  usual  distortion  theorem 
which  was  there  used  to  clinch  the  argument,  recourse  has  to  be  made  here 
to  a  well-known  generalization  of  the  distortion  theorem  which  asserts 
that  the  first  coefficient  a  of  an  odd  "schlicht"  function  W~OLZ+..., 
bounded  by  unity  and  leaving  out  the  points  w  =  ±  I  (l>0),  attains  its 
largest  modulus  for  the  particular  function  mapping  |  z  \  <  1  on  the  full 
circle  |  w  \  <  1  cut  radially  from  w  =  I  to  w  =  1  and  from  w  —  —ltow  =  —  1  . 

5.    In  this  section  we  shall  make  use  of  the  conformal  representation 

effected  by  the  function  w  =  sin  {  -  -  5  [  discussed  earlier,  in  order  to  prove 

(I  -{-z  j 

a  theorem  on  doubly  -connected  domains.  The  main  argument  used  here 
will  in  many  respects  be  similar  to  the  principle  of  subordination  used  in 
the  case  of  simply-connected  domains. 

A  function  w  ~  f(z)  mapping  an  annulus  1  <  |  z  \  <  m  on  a  doubly-connected 
domain  D  may  be  said  to  be  subordinate  to  a  function  w  =  $(z)  mapping 
an  annulus  1  <  |  z  \  <  M  on  a  doubly-connected  domain  D,  if  the  relation 


holds,  where  o)(z)  is  regular  for  1  <  |  z  \  <  m,  satisfies 


and  the  argument  of  a)(z)  does  not  return  to  its  initial  value  if  z  describes 
any  circle  |  z  \  =  £  (1  <p  <  m). 

In  case  D  does  not  overlap  itself,  i.e.  is  a  "schlicht"  domain,  this  defini- 
tion of  subordination  is  equivalent  to  saying  that,  for  1  <  \  z  \  <  m,  f(z)  takes 
only  values  inside  D  (and  that,  inside  Z>,  the  inner  or  outer  boundary  of  D 
cannot  shrink  continuously  into  evanescence). 

The  applicability  of  this  definition  of  subordination  lies  in  the  well- 
known  fact  that  in  the  case  of  both  domains  D  and  D  being  "schlicht", 


1945]  ON  UNIVAXENCY  OF  ANALYTIC  FUNCTIONS  131 

the  modulus  m  of  D  cannot  be  greater  than  the  modulus  M  of  D,  the  modulus 
of  a  doubly-connected  "schlicht"  domain  being  a  steadily  increasing 
function  of  the  domain. 

However,  this  property  of  the  modulus  steadily  increasing  with  the  domain 
breaks  down  if  the  domains  D  and  D  are  not  both  "schlicht"  or,  which  is 
somewhat  less,  if  oj(z)  is  not  a  "schlicht"  function  of  z  for  1  <  |  z  \  <  m. 

Nevertheless,  the  inequality  m^M  holds  also  in  the  general  case,  as  is 
shown  by  the  following  lemma  due  to  M.  Schiffer: 

LEMMA.*  Let  D  be  a  doubly  connected  domain  contained  in  the  annulus 
1  <  |  w  |  <  M.  If  the  argument  of  a  point  w  does  not  return  to  its  initial  value 
when  this  point  describes  once  the  inner  boundary  of  D  (or  the  outer 
boundary,  for  that  matter),  the  modulus  m  ofD  satisfies  the  inequality 


Here  the  sign  of  equality  can  only  hold  if  D  is  identical  with  the  annulus 
l<\w\<M. 

In  order  to  prove  the  lemma,  we  denote  by  w  =  f(z)  the  function  mapping 
1  <  |  z  |  <  m  on  D  and  consider  the  function 

g(z)  =  *-«/(*),  (9) 

where  2?m  is  the  total  increase  of  the  argument  off(z)  when  z  once  describes 
any  circle  \z\  =  r,  l<r<m.  Here  the  integer  n  has  been  assumed  to  be 
positive,  as  otherwise  we  could  have  considered  the  function  M/f(z)  instead. 
Obviously,  n  must  be  finite  since  otherwise  D  would  be  simply-connected. 

From  (9)  it  follows  that  the  argument  of  g(z)  returns  to  its  initial  value 
when  z  once  describes  any  circle  \z\  =  r,  I  <r  <m.  Now,  D  being  contained 
in  the  annulus  1  <  |  w  \  <  M,  we  have,  for  |  z  \  =  1, 


«!/(*)  |^1, 

and  for  |  z  |  =  m,  |  g(z)  \  =  m~n  \  f(z)  \  ^  Mm~n. 

(Here/(z)  is  supposed  to  be  regular  on  both  boundaries  of  the  annulus;  if 
this  is  not  the  case,  an  annulus  1  -\-e<  \z\  <m  —  e  with  sufficiently  small  e 
has  to  be  considered  instead.)  We  now  suppose  for  a  moment  that  M  <m. 
This  would  require  that,  for  |  z  |  =  1,  we  should  have  |  g(z)  \  ^  1  and  for 
|  z  |  =  m  we  should  have  |  g(z)  \  <  1,  i.e.  the  two  boundaries  of  the  domain 
D'  on  which  w  =  g(z)  maps  1  <  |  z  \  <  m  would  be  separated  from  each  other 
by  a  circle  |  w  \  —  1  —  $  (3  sufficiently  small)  which  is  fully  covered  by  D' 
and  on  which  lie  no  boundary  points  of  D'  .  But  this  contradicts  the  fact 
that  the  argument  of  g(z)  returns  to  its  initial  value  after  one  full  circuit 

*  I  owe  this  lemma  and  its  proof  to  a  conversation  which  I  had  with  Dr  M.  Schiffer. 

12 
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of  z;  indeed,  g~1(w)  would  be  defined  for  all  points  of  the  circle  |  w  \  =  1  —  8, 
and  after  w  describing  this  circle  a  sufficient  number  of  times,  say  n'  times, 
g~1(w)  would  return  to  its  initial  value,  i.e.  there  would  exist  a  closed  curve 
inside  the  annulus  1  <  |  z  \  <  m,  such  that  after  z  once  describing  this  curve 
the  argument  of  g(z)  would  increase  by  27ra',  which  is  impossible. 

The  case  m  =  M  can  only  occur  for  f(z)^Kz,  |  jK"|  =  1.  In  fact,  for 
|  z  |  =  m  we  had  |  g(z)  \  ^  Mm~n\  since  |  g(z)  |  <  1  led  to  a  contradiction,  we 
must  have  n  =  1.  Moreover,  it  is  clear  that  the  only  case  for  which  |  g(z)  \  ^  1 
will  not  lead  to  a  contradiction  is  that  in  which  the  domain  D'  on  which 
w  =  g(z)  maps  1  <  |  z  \  <  m  reduces  to  a  single  point,  i.e.  w  =  g(z)  is  a  constant. 
This  completes  the  proof  of  the  lemma. 

As  an  application  of  the  lemma  we  shall  prove  here  the  following  theorem: 

THEOREM  IV.  Let  D  be  a  doubly  -connected  —  not  necessarily  "schlicht"  — 
domain  of  the  modulus  m;  let  P  be  a  point  of  the  outer  boundary  of  D  and  Q 
a  point  of  its  inner  boundary,  such  that  P  may  be  connected  with  Q  by  a 
"  schlicht"  curve]  let  further  R  be  another  point  of  the  inner  boundary  of  D 
such  that  P,  Q,  R  lie  on  a  straight  line  and  that  Q  and  R  may  be  connected  by 
a  curve  not  covered  by  D\  if  we  denote  the  distances  PQ  and  PR  by  d  and  a 
respectively,  we  have 


where  %(x)  is  the  elliptic  modular  function.  This  inequality  is  the  best  possible. 
Remark.  The  distances  a  and  d  being  generally  more  easily  accessible 
than  the  modulus  m,  it  would  perhaps  be  more  appropriate  to  read  this 
result  in  reversed  order,  so  as  to  obtain  a  bound  for  the  modulus  m  with 
the  help  of  a  and  d  which  are  presumed  as  known.  The  function 


increases  steadily  with  m  ;  we  have  accordingly 


where  the  function  q(x)  is  determined  by 

x  -  ctg 


As  to  the  proof,  we  shall  —  as  in  the  cases  of  theorems  I  and  III  —  proceed 
by  stages,  first  proving  the  theorem  for  a  special  case  and  then  showing 
that  the  result  obtained  holds  also  in  the  general  case.  This  special  case  will 
be  that  in  which  the  curve  connecting  P  and  Q  is  the  stretch  PQ,  and  the 
stretch  QR  is  outside  D.  Without  loss  of  generality  we  may  assume  R  to 
coincide  with  w  =  0  and  both  P  and  Q  to  lie  on  the  positive  axis.  We  may 
further  assume  that  R  coincides  with  w  =  1  (this  involving  a  trivial  trans- 
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formation).  This  domain  D  leaves  out  the  stretch  0  <  w  <  c  (c  =  a  /(a  +  d)  in 
our  previous  notation)  and  covers  simply  the  stretch  c<w<l.  w  —  c 
belongs  to  the  inner  boundary  of  D  and  w  =  1  to  its  outer  boundary. 

We  are  now  going  to  construct  a  surface  D  majorant  (in  the  extended 
sense  defined  above)  to  D.  To  this  end  we  recall  the  properties  of  the  surface, 

used  in  the  proof  of  theorem  I,  on  which  the  function  w  —  sin2  j™1-—  }  maps 

the  circle  \z\  <\.  This  surface  covers  the  ^-plane  an  infinity  of  times, 
apart  from  the  stretch  0  ^  w  <  1  which  is  covered  exactly  once  and  the 
points  w  =  1  and  w  —  oo  which  are  not  covered  at  all.  If  we  now  cut  this 
surface  along  the  stretch  0  <  w  <  c,  we  obtain  a  doubly-connected  surface, 
say  Z>,  which  will  have  the  required  majorant  properties. 

For,  let  w  —  f(z)  now  be  the  analytic  function  mapping  the  annulus 
1  <  \  z  \  <  m  on  D  and  w  =  $(z)  the  function  mapping  1  <  \  z  \  <  M  on  D. 
If  we  then  consider  the  function  a>(z)  defined  by 


it  is  clear  that  a>(z)  is  regular  and  uniform  for  all  values  I<\z\<m,w  =  f(z) 
taking  the  values  c  <  w  <  1  only  once  and  leaving  out  the  values  0  <  w  ^  c, 
w  =  1  and  w  =  oo.  Moreover,  the  stretch  0  <  w  <  c  connects  two  points 
of  the  inner  boundary  of  D  and  is  not  itself  covered  by  D  ;  it  follows  that 
the  total  change  of  argument  of  /(z)  —  y,  where  ?/  is  a  point  of  Q<?j<c 
and  z  completes  a  full  circuit  of  the  inner  boundary  of  D,  cannot  reduce 
to  zero.  We  have  therefore,  in  virtue  of  the  lemma, 


m  being  the  modulus  of  D  and  M  that  of  D. 

We  now  find  an  analytical  expression  for  $(z),  or  rather  for  the  function 
3^(2)  =  $(mz)  which  is  more  convenient  to  handle.  As  in  similar  previous 
cases,  it  will  be  simpler  to  deal  with  the  symmetrical  function  $  (z)  defined  by 


This  function  maps  the  annulus  M  ~*  <  \  z  \  <  1  on  a  surface  covering  the 
w-plane  an  infinity  of  times,  apart  from  the  stretches  —  1  <  w<  -~*Jc  and 
^c<w<l  which  are  simply  covered  and  the  stretch  —^c<w<^c  and  the 
points  w  =  +  1,  oo  which  are  not  covered  at  all.  Now  this  surface  is  obtained 
from  the  surface  on  which 


maps  |  z  \  <  I  by  applying  a  cut  along  the  stretch  —*Jc<w<*jc.  w  =  sin  I  —  —  -  \ 

{  1  -f  z  } 

being  real  for  real  2,  this  corresponds  to  a  cut  in  the  z-plane  from  z  —  —  I  to 
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nl 


z  —  I,  where  I  is  defined  by  ^/c  =  sin  *         2 
therefore  be  of  the  form 


The  function  w  —  (&(z)  will 


where  u  =  p(z)  is  the  function  mapping  the  annulus  p  <  \  z  \  <  1  (p 
on  the  interior  of  the  circle  |  u  \  <  1,  cut  along  the  stretch  —l<u<l.  Our 
task  is  therefore  reduced  to  finding  an  expression  for  the  function  u  =  p(z). 
Although  u  =  p(z)  can  be  readily  expressed  in  terms  of  Jacobian  elliptic 
functions  by  making  use  of  their  known  conformal  mapping  properties, 
it  is  perhaps  more  in  the  spirit  of  the  theory  of  conformal  representation  to 
construct  the  analytical  expression  of  u  =  p(z)  direct  from  its  mapping 
properties. 

u  —  p(z)  maps  the  annulus  p  <  |  z  \  <  1  on  the  "schlicht"  circle  |  u  \  <  1 
cut  along  the  stretch  —l<u<l.  Both  domains  being  symmetrical  with 
regard  to  the  real  axis  and  the  origin,  we  may  assume  p(z)  to  be  real  for  real 
z  and  have  then  p(p)^l,  p(—p)  =  —l,  p(l)  =  l,  jp(  —  1)  =  —  1.  Since 
u  =  p(z)  transforms  |  z  \  =  1  into  |  u  \  =  1  ,  we  have,  by  Schwarz's  symmetry 
principle,  p(z~  ~l)p(z)  =  1  or  (in  view  of  p(z)  =  p(z)) 

p(z)p(z-i)  =  1.  (10) 

As  u  =  p(z)  further  transforms  the  circle  |  z  \  =  p  into  parts  of  the  real  axis 
we  have,  again  by  the  symmetry  principle,  p(p2z~l)  =  p(z),  which,  with 

p(*)-W),  gives  P(Z)  =  p(p*z-i).  (11) 

By  suitably  combining  (10)  and  (11)  we  obtain  the  functional  equation 

p(z)  =  p(p*z),  (12) 

which  makes  it  clear  that  p(es)  is  an  elliptic  function  possessing  the  funda- 
mental periods  2ni  and  41og/o. 

From  the  symmetry  of  the  function  u  =  p(z)  it  follows  that  p(z)  vanishes 
for  z  =  ±  ip,  and  consequently,  in  view  of  (12),  also  at  all  points  z  =  ±  ip*n+1 
(n  —  ±1,  ±2,  ...).  The  mapping  being  conformal  at  these  points,  all  these 
zeros  are  simple.  As  is  easily  seen,  there  are  no  other  zeros  of  u  —  p(z). 
In  virtue  of  (10),  all  poles  of  u  —  p(z)  are  situated  at  the  points  z  =  ±  ip^n~1 
(n  —  Oj  ±  1,  ±  2,  ...),  all  of  them  being  simple.  We  then  consider  the  infinite 
product 


ilog-;^, 


(13) 
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which  converges  for  all  values  of  z  other  than  0,  oo  and  the  obvious  poles. 
The  zeros  and  poles  of  (f)(z)  are  identical  with  those  of  p(z).  Moreover,  we 

have  obviously  ,,  ,  .        ,,. 

y  <j>(p*z)  =  0(z), 

so  that  (j)(z)  (or  rather  <j>(ez))  is  an  elliptic  function  possessing  the  same  zeros 
and  poles  as  p(z).  By  Liouville's  theorem,  <j>(z)  and  p(z)  are  therefore 
identical,  apart  from  a  multiplicative  constant  which,  however,  as  shown 
by  (10),  can  only  have  the  values  ±  1.  Hence  we  have 

u  =  p(z)  =  <j)(z)  =  sn{ilogz/p;  p4}, 

sn  being  the  Jacobian  elliptic  function.  What  we  are  mainly  interested  in  is 
the  length  of  the  stretch  —l<u<l.  Remembering  that  p(p)  =  I,  we  obtain, 
in  view  of  (13), 


where  %  (x)  denotes  the  elliptic  modular  function. 

Now  I  was  connected  with  the  quantity  c  (c  —  a/(a  +  d))  by  the  equation 


satisfying  a  functional  equation  which  —  in  terms  of  Z  —  may  be  written 


we  are  led  to  c  =  sin2  {%nl2(^/p)}  =  sin2{j7rV[£(/o2)]}. 

Remembering  that  c  =  a/(a  +  d)  and  p  =  M~*,  we  obtain  finally 


so  that  ^  =  actg2{i7rV[X(lf-1])}.  (14) 


Since  the  right-hand  side  of  (14)  increases  steadily  with  M,  we  have  —  in 

viewofm^M  —  ,  „ 

d  ^  a  ctg2  { 


This  completes  the  proof  for  the  case  in  which  both  the  simply-covered 
curve  connecting  P  with  Q  and  the  non-covered  curve  connecting  Q  with 
R  were  linear  segments. 

As  for  the  case  of  general  curves,  this  can  again  be  reduced  to  the  special 
case  discussed  by  considering  the  function 
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Exactly  as  in  the  proof  of  theorem  I  it  can  be  shown  that  the  surface  H  on 
which  x  —  q(z)  maps  the  annulus  1  <  |  z  \  <  m  leaves  out  a  curve  connecting 
x  —  0  with  both  points  x  —  ±ain~1^Jc  (this  curve  being  inside  the  inner 
boundary  of  H)  and  two  curves  connecting  both  x  =  1  and  x  =  —  1  with 
x  —  co  (these  curves  being  outside  the  outer  boundary  of  H).  x  =  q(z) 
being  an  odd  function  of  z,  the  two  latter  curves,  on  the  one  hand,  and  the 
two  parts  of  the  first  curve  connecting  x  =  0  with  the  points  x  =  ±  siii"1  ^c 
on  the  other,  may  be  assumed  symmetrical  with  regard  to  x  =  0.  By  the 
lemma  the  modulus  of  H  cannot  be  greater  than  that  of  the  "schlicht" 
domain  H',  symmetrical  with  regard  to  x  =  0,  which  covers  the  entire 
aj-plane  apart  from  these  curves.  By  a  theorem  due  to  Groetzsch,*  the 
modulus  of  H'  is  smaller  or  equal  to  that  of  the  special  "schlicht"  domain 
which  consists  of  the  whole  #-plane  cut  along  the  Rtretch 


and  the  rays  —co^x^  —  1  and  1  ^  x  ^  oo.  But  this  special  domain  leads  us 
exactly  to  our  function  which  solved  the  extremum  problem  in  the  special 
case  discussed.  Consequently,  $(z)  gives  also  the  solution  in  the  general 
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ON  THE  FOUR-COLOUR  CONJECTURE 

By  W.  T.  TUTTE 
[Received  27  November  1945  —  Read  13  December  1945] 

1.    Introduction 

The  maps  discussed  in  this  paper  are  dissections  of  surfaces  into  simple 
polygons,  called  regions.  In  each  map  it  is  supposed  that  the  regions  are 
finite  in  number  and  that  each  vertex  of  a  region  is  common  to  just  three 
regions.  Sides  and  vertices  of  the  regions  will  be  called  edges  and  vertices 
of  the  map  respectively. 

A  colouring  of  a  map  M  is  denned  as  a  set  of  four  mutually  exclusive 
subclasses,  called  colour  -classes,  of  the  regions  of  M  such  that  each  region 
belongs  to  some  colour-class  and  110  two  regions  of  the  same  colour-class 
have  any  edge  in  common.  If  Z  is  a  colouring  whose  colour-classes  are 

Ct,  O2,  Oo  and  (7d,  we  write 
1     2     3  "  Z  =  (C^C^C^C,). 

The  union  A  u  B  of  two  colour-classes  A  and  B  of  a  colouring  Z  we  call 
a  colour  -dyad.  The  regions  of  a  colour-dyad  may  or  may  not  form  a  connected 
set.  In  any  case  we  call  the  disjoint  internally  connected  components 
Kempe  chains  and  denote  their  number  by  cQ(A  u  B). 

If  U  is  a  Kempe  chain  of  A  u  B  and  V  is  the  remainder  of  A  u  J5,  then  it 
is  clear  that  the  four  sets 

((AnU)u(BnV)),     ((BnU)v(AnV)),     C,     D, 

where  Z  =  (A,  B,  G,D)  and  Ac\U,  for  example,  denotes  the  intersection 
of  A  and  £7,  constitute  a  colouring  Z±  of  the  map  concerned  and  that  Z± 

differs  from  Z  if  and  only  if 

c0(AvB)>l. 

We  say  that  Zl  is  derived  from  Z  by  an  exchange  operation  on  the  Kempe 
chain  U.  The  set  of  all  colourings  of  the  map  that  can  be  derived  from  Z  by 
a  finite  sequence  of  exchange  operations  we  call  the  colour-system  containing 
Z  and  denote  it  by  Yl(Z).  Clearly,  if  y  is  any  colouring  in  H(Z),  then 


The  problem  with  which  this  paper  is  concerned  is  as  follows. 
PROBLEM.  Let  M  be  a  map  on  the  sphere,  and  let  M  contain  a  pentagon  P. 
Let  any  colouring  Z  of  that  part  MP  of  M  which  is  exterior  to  P  be  said 
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to  be  of  type  I  if  it  has  a  colour-class  A  which  contains  no  region  of  MP 
adjacent  to  P,  and  of  type  II  otherwise. 

The  problem  is  to  find  MP  and  Z  such  that  all  the  colourings  of  Tl(Z)  are 
of  type  II. 

A  demonstration  that  this  problem  is  insoluble  would  complete  the 
verification  of  the  four-colour  conjecture  by  enabling  us  to  deduce  from  the 
existence  of  a  colouring  of  MP  the  existence  of  a  colouring  of  type  I  of  MP 
and  thence  (by  assigning  P  to  A)  the  existence  of  a  colouring  of  M.~\ 

The  contribution  of  this  paper  to  the  problem  is  the  deduction  of  some 
new  limitations  on  the  structure  of  II  (Z)  which  must  be  satisfied  in  any 
solution  that  may  exist. 

Use  is  made  of  an  elementary  general  theorem  on  the  colourings  of 
spherical  maps,  to  which  I  have  not  seen  any  reference  in  the  literature. 
This  is  the  "Parity  Theorem"  proved  below. 

Reference  may  be  made  to  a  paper  by  KittelJ  on  the  above  problem. 
In  comparing  this  paper  with  his  it  should  be  borne  in  mind  that  he  counts 
as  distinct  colourings  which  differ  only  by  a  redistribution  of  "colours" 
among  the  colour-classes  —  a  distinction  which  has  no  meaning  with  the 
definitions  used  here. 

2.    The  parity  theorem 

Let  Zi  —  (A,B,  C,D)  be  a  colouring  of  the  spherical  map  M  ofoc2  regions. 
Let  a2(X)  and  oc2(X  \j  Y)  denote  the  number  of  regions  in  the  colour-class 
X  and  the  colour-dyad  (X  u  Y)  respectively,  and  let  fi-^X  u  Y)  be  the 
number  of  edges  in  which  regions  of  X  meet  regions  of  Y.  Then,  if  c1(JC  u  Y) 
is  the  connectivity  of  the  set  of  regions  X  u  Y,  we  have  by  elementary 
topology  §  CO(A  u  B)~c^(A  vB)  =  a2(A  u  B)-^(A  u  B)  (I) 

and  Ci(AuB)  =  cQ(CvD)-I.  (2) 

From  these  equations  and  the  corresponding  ones  for  the  colour-dyads 
A  v  C  and  A  u  D  it  follows  that 
c»(A  u  B)  +  cQ(A  vC)  +  c0(A  vD)-c0(CuD)-  c»(B  vD)-c0(Bu  C) 


3,  (3) 

ReA 

where  /(  R)  denotes  the  number  of  sides  of  the  polygon  R. 


f  Heawood,  Quart.  J.  of  Math.  24  (1890),  332-338. 

|  Kittel,  Bull.  American  Math.  Soc.  41  (1935),  407-413. 

§  See,  for  example,  Newman,  Topology  of  Plane  Sets  (C.U.P.  1939),  194-199. 
The  regions  must  be  dissected  into  triangles  before  his  results  are  applied,  but  this 
introduces  no  real  difficulty. 
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Now  the  right-hand  side  of  (3)  depends  only  on  the  colour-class  A.  Hence 
we  have 

THEOREM  I.  The  quantity  on  the  left  of  equation  (3)  has  the  same  value  for 
all  colourings  of  M  for  which  A  is  a  colour-class. 

We  say  that  two  colourings  Z±  and  Z%  of  a  map  are  connected  if  there 
exists  a  finite  sequence  of  colourings  of  the  map  beginning  with  Z±  and 
ending  with  £/2  and  such  that  each  pair  of  consecutive  members  have 
a  colour-class  in  common.  Clearly  any  two  colourings  which  belong 
to  the  same  colour-system  are  connected.  (But  the  converse  does  not 
follow.) 

For  any  colouring  Z,  let  J(Z)  denote  the  sum  of  the  quantities  cQ(X  \j  Y) 
over  the  six  colour-dyads.  We  call  the  parity  of  J(Z)  the  parity  of  Z. 

THEOREM  II  (the  parity  theorem).  //  Zl  and  Zz  are  connected  colourings 
of  a  spherical  map  M,  then 

J(Z1)~J(Z2))     (mod  2).  (4) 

For  by  (3),  for  any  colouring  of  M 

J(Z>)=  E/(^)  +  #2+1»     (mod  2), 

KeA 

where  A  is  any  colour-class  of  Z. 

Hence  (4)  is  true  whenever  Z^  and  Z2  have  a  colour-class  in  common, 
and  therefore  it  is  true  whenever  Z^  and  £/2  are  connected. 

It  may  be  worth  mentioning  that  the  colourings  of  a  particular  spherical 
map  need  not  be  restricted  to  one  parity.  For  example  the  two  colourings 
shown  in  Fig.  1  have  opposite  parities. 


Fig.  1 
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3.    The  pentagon  problem 

Let  us  now  return  to  the  problem  of  the  partially  coloured  map  stated  in 
the  introduction. 

Let  F  denote  the  set  of  regions  of  M  which  meet  P.  We  denote  these 
regions  by  Flt  F2,  F3,  F4,  F5  in  the  cyclic  order  of  incidence  with  the  edges 
of  P.  We  may  suppose  that  they  are  taken  in  clockwise  order  as  seen  from 
the  centre  of  the  sphere.  We  have  not  yet  assumed  that  they  are  all 
distinct. 

In  a  colouring  of  type  II  each  colour-class  contains  a  region  of  F.  Three 
colour-classes  therefore  each  meet  P  in  one  edge  only,  and  the  fourth  meets 
it  in  just  two  edges,  E±  and  E2  say.  E1  and  E%  do  not  meet,  for  if  they  did  the 
third  edge  incident  with  their  common  vertex  would  be  common  to  two 
regions  of  the  fourth  colour-class.  There  is  therefore  an  edge  E%  of  the 
pentagon  adjacent  to  both  E^  and  E2.  We  call  the  set  of  regions  of  F  which 
have  E1  or  E2  as  an  edge  the  norm  of  the  colouring,  and  the  region  of  F 
having  E3  as  an  edge  the  apex  of  the  colouring. 

The  following  well-known  result  will  be  needed. f 

THEOREM  II T.  If  Z  is  a  colouring  of  MP  such  that  all  the  members  of  H(Z) 
are  of  type  //,  then  if  any  Kempe  chain  contains  the  norm  of  Z  it  contains  also 
the  apex  of  Z. 

For  let  Z  =  (A,B,C,  D)  where  A  contains  the  norm  and  B  the  apex  of  Z. 
Then  if  the  theorem  is  false  for  Z  one  of  the  remaining  two  regions  of  F, 
e  C  say,  must  be  separated  in  MP  from  the  apex  by  a  Kempe  chain  of  A  u  D 
containing  the  norm.  Hence,  by  an  exchange  operation  on  that  Kempe 
chain  of  B  u  C  which  contains  the  apex,  a  type  I  colouring  of  il(Z)  can  be 
obtained,  contrary  to  hypothesis. 

It  follows  from  theorem  III  that  if  all  the  colourings  of  II (Z)  are  of 
type  II,  then  the  five  regions  Ft  are  all  distinct.  We  therefore  assume  their 
distinctness  in  what  follows. 

If  Fi  is  the  apex  of  Z,  then  the  norm  of  Z  is  the  pair  of  regions  Fi+lt  Fi+4:. 
(The  addition  in  the  suffices  is  modulo  5.)  We  then  denote  the  Kempe  chain 
containing  FM  and  Fi+2  by  g(Z)  and  the  colour-dyad  to  which  it  belongs 
by  O(Z).  We  denote  the  Kempe  chain  of  G(Z)  which  contains  the  other 
member  Fi+4!  of  the  norm  by  h(Z).  It  follows  from  theorem  III  that  g(Z) 
and  h(Z)  are  distinct. 

We  define  a  ^-operation  on  Z  as  the  application  of  the  exchange  operation 
to  each  member  of  a  subset  A  of  the  Kempe  chains  of  G(Z),  where  A  contains 
g(Z)  but  not  h(Z). 

f  Heawood,  loc.  cit» 
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(Note.  A  similar  operation  on  G(Z)  affecting  h(Z)  but  not  g(Z)  would 
be  equivalent  to  the  A-operation  affecting  just  those  components  of  G(Z) 
which  the  former  operation  left  unchanged.) 

We  define  a  /(.-circuit  in  II (Z)  as  a  cyclic  sequence  of  colourings  belonging 
to  IT  (Z)  such  that  each  member  of  the  sequence  can  be  derived  from  its 
immediate  predecessor  by  a  single  A-operation.  A-operations  and  A-circuits 
can  of  course  be  defined  in  the  same  way  when  11  (Z)  contains  type  I 
colourings  but  they  are  of  particular  interest  in  the  other  case,  for  then  it 
has  been  shown  that  every  colouring  in  II  (Z)  is  a  member  of  a  A-circuit,f 

If  a  A-operation  is  applied  to  a  colouring  Z  of  apex  Fi  the  norm  of  the  new 
colouring  will  be  the  pair  Fi+2>  ^+4,  and  its  apex  will  therefore  be  FM. 
The  colour-classes  of  the  old  and  new  apices  must  be  common  to  both 
colourings,  since  neither  belongs  to  G(Z).  Each  A-operation  thus  advances 
the  apex  three  places  in  the  cyclic  sequence  of  the  F^  Hence  if  the  number 
of  members  of  any  A-circuit  is  n,  then 

»==0,     (mod  5).  (5) 

The  object  of  this  paper  is  to  improve  upon  this  result  by  establishing 
the  following  two  theorems: 


sO,     (mod  10). 


THEOREM  IV. 
THEOREM  V. 


4.    Proof  of  theorem  IV 

Let  XZ  be  the  colouring  obtained  from  Z  by  application  of  the  A-operation 
A.  The  intersections  of  the  colour-classes  with  F  will  be  as  shown  in  the 
diagrams  (i)  and  (ii)  of  Fig.  2  for  Z  and  XZ  respectively.  The  colour-classes 
B  and  D  are  common  to  both  colourings. 


C' 


Fig.  2 


|  Errera,  "Du  Coloriage  des  Cartes  et  de  Quelques  Questions  d'Analysis  Situs1 
Thesis  (Gauthier-Villars,  1921).  See  also  Kittel,  loc.  cit. 
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Since  the  regions  of  F  are  distinct,  a  new  spherical  map  N  can  be  formed 
from  M  by  incorporating  P  in  that  region  of  F  which  belongs  to  B  in  the 
colourings  Z  and  XZ.  If  the  new  region  is  assigned  to  B,  the  colourings  Z 
and  XZ  will  be  transformed  into  colourings  ZN  and  (XZ)N  respectively 
of  N.  These  two  colourings  of  N  are  connected,  for  they  have  the  colour-class 
D  in  common.  Hence  by  the  parity  theorem 

J(ZN)  =  J((XZ)N)t     (mod  2).  (G) 

Now  in  the  change  from  Z  to  ZN  the  only  two  of  the  six  quantities  C0(  X  u  Y) 
not  obviously  unaltered  are  c0(B  u  (7)  and  c0(B  u  D).  But  for  one  of  these, 
let  us  say  c0(B  u  (7),  to  be  altered  it  is  necessary  for  the  two  regions  in  which 
the  corresponding  colour-dyad  of  Z  meets  F  to  be  in  different  Kempe  chains 
of  that  colour-dyad.  This  colour-dyad  (B  u  C)  of  Z  would  then  contain  the 
apex  and  would  not  separate  the  members  of  the  norm  in  MP.  The  members 
of  the  norm  would  therefore  belong  to  the  same  component  of  (A  u  D)  in  Z, 
which  contradicts  theorem  III,  since  the  apex  of  Z  is  in  B  and  therefore 
not  in  (A  u  D).  Hence 

J(ZN)  =  J(Z).  (1) 

Again  in  the  change  from  XZ  to  (XZ)N  the  only  quantities  c0(X  u  Y)  not 
obviously  unaffected  are  cQ(B  u  D)  and  c0(B  u  A').  But  the  first  of  these  is 
unchanged,  by  the  last  paragraph,  since  B  u  D  is  the  same  in  Z  as  in  XZ 
and  in  ZN  as  in  (XZ)N.  However,  by  theorem  III  the  two  members  of  the 
norm  in  XZ  are  in  different  Kempe  chains  of  (  B  u  A')  (which  does  not  contain 
the  apex).  cQ(B  uA')  therefore  decreases  by  1  and  so 


1.  (8) 

From  (6),  (7)  and  (8)  we  have 

J(\Z)  =  J(Z)+lt     (mod  2).  (9) 

It  follows  that  the  number  of  members  of  any  A-circuit  must  be  even. 
Hence  by  (5),  na0>  (modlo)  (10) 

5.    Proof  of  theorem  V 

Assume  that  a  A-circuit  of  10  members  exists.  Let  its  members  be  in 
order  Z0,  Zlt  Za,  ...,  ZQ. 

Let  At  denote  that  colour-class  of  Zi  which  contains  the  apex.  By  the 
paragraph  immediately  preceding  equation  (5),  A^^  Ai  and  Ai+1  are  all 
colour-classes  of  Zt.  Moreover  they  are  distinct,  for  the  apices  of  ^_15  Zt 
and  Zi+1  are  distinct  regions  Fp  ^+3  and  Fj+l.  (Addition  in  the  suffices  of 
the  At  and  the  Z^  is  mod  10.) 
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The  ten  colourings  Zi  are  therefore  completely  determined  when  the  ten 
colour-classes  Ai  are  given,  for  when  three  colour-classes  of  a  colouring  are 
given  the  fourth  is  uniquely  determined. 

If  R  is  any  region  of  MP  we  write 

vt  =  Vi(R)  =  1     if    R  is  in  Ait 

=  0     if    R  is  noting.  (11) 

Then  the  set  of  10-vectors 


given  for  all  R  completely  determines  the  Ai  and  therefore  the  Z^. 

As  an  example  we  note  that  Fj  is  in  Ai  if  and  only  if  it  is  the  apex  of  Z^ 
and  that  this  happens  just  once  in  any  five  consecutive  members  of  the 
A-circuit.  So  we  may  write 

V(FL)  =  (1000010000),  (12) 

a  vector  which  we  denote  hereafter  by  e. 

We  then  have,  by  the  properties  of  A-operations, 


and  in  general,  V(F1+r)  =  Q2re,  (13) 

where  Q  is  a  cyclic  permutation  defined  by 

Q(VO)VI}  ...,v9)  =  (vQ9v0,vl9  ...,v8).  (14) 

We  denote  the  group  of  cyclic  permutations  Q*  by  £X  and  say  that  two 
10-vectors  are  equivalent  when  they  can  be  transformed  into  one  another 
by  operations  of  £}. 

A  10-vector  whose  components  are  restricted  to  the  values  0  and  1  we 
call  a  V-vector. 

We  say  that  a  F-  vector  V  is  admissible  if  it  satisfies  the  following  con- 
ditions: (i)  no  three  consecutive  signs  of  V  (regarded  as  a  cyclic  sequence) 
include  more  than  one  1  ;  and  (ii)  there  exist  three  other  F-vectors  satisfying 
(i)  such  that  their  sum  with  F  is  /  =  (1  1  1  1  1  1  1  1  1  1). 

The  other  three  F-vectors  of  (ii)  are  clearly  admissible.  We  call  a  set  of 
four  admissible  F-vectors  whose  sum  is  /  a  tetrad. 

We  have  at  once 

LEMMA  I.  //  V±  and  V2  are  equivalent  V  -vectors  and  Vl  is  admissible,  then 
V2  is  admissible. 

For  the  properties  (i)  and  (ii)  are  invariant  under  the  transformation  Q. 

LEMMA  II.  //  R  is  any  region  of  MP,  then  V(R)  is  an  admissible  V-vector. 
For  first,  by  (11),  V(R)  is  a  F-vector. 
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Secondly,  suppose  two  of  vt_i(R),  v^R),  and  vi+1(R)  have  the  value  1. 
Then,  by  (11),  R  belongs  to  two  of  the  colour-classes  Af_l9  At,  Ai+l  which  is 
impossible  since  these  are  distinct  colour-classes  of  the  same  colouring  Z€. 
Hence  V(R)  satisfies  (i). 

Thirdly,  any  region  RQ  of  MP  has  some  vertex  not  a  vertex  of  P.  For  if 
this  were  false  for  F}  then  J^_j  and  FJ+1  would  not  be  distinct.  At  this  vertex 
R0  meets  two  other  regions,  R±  and  Rz  say,  of  MP.  Consider  the  matrix 
whose  three  rows  are  the  vectors  V(R0),  V(Rl),  V(R2). 

No  column  of  this  matrix  contains  more  than  one  1,  since  no  two  of 
R0)  R±  and  R2  belong  to  the  same  colour-class  in  any  colouring  of  MP.  Hence 
V  =  /  -  F(JR0)  ~  V(RJ  ~  V(R2)  is  a  F-vector. 

Consider  the  (i—  l)th,  *th  and  (i  +  l)th  columns  (addition  mod  10).  If 
two  of  them  consist  entirely  of  O's,  then  Zt  has  a  colour-dyad  containing 
none  of  the  regions  R0,  R±  and  R2  by  (1 1 ).  But  this  is  impossible  for  it  requires 
that  two  of  these  mutually  contiguous  regions  shall  belong  to  the  same 
colour-class  of  Zt.  It  follows  that  the  F-vector  V  satisfies  (i)  and  so  by  the 
previous  result  that  V(R)  is  a  F-vector  satisfying  (i),  and  by  the  definition 
of  F',  it  follows  that  V(R0)  satisfies  (ii).  This  proves  the  lemma. 

COROLLARY.  //  three  regions  R0,  R1  and  R2  of  MP  meet  at  a  vertex,  then  the 
four  vectors  V(RQ),  V(R1)}  V(RJ,  I-~V(R0)-V(Rl)-V(R2)  are  admissible 
V -vectors  and  constitute  a  tetrad. 

If  F  is  a  F-vector,  we  denote  by  cr(  F)  the  number  of  its  1  's.  By  considering 
in>turn  the  cases  cr(F)  =  0,  o~(V)  —  1,  etc.,  we  find  that  every  F-vector 
satisfying  (i)  is  equivalent  to  a  member  of  the  following  set. 

a  =  (0000000000), 

b  =  (1000000000), 

c=  (1001000000), 

'    d=  (1000100000), 

e=  (1000010000), 

/=  (1001001000). 

Clearly  no  two  members  of  this  set  are  equivalent. 

If  x  is  one  of  the  vectors  satisfying  (i)  we  define  m(x)  as  the  smallest 
integer  m  not  0  such  that  Qmx  =  x.  We  then  have 

m(x)  =  1  if  x  is  equivalent  to  a, 
=  5  if  x  is  equivalent  to  e, 
=  10  otherwise.  (16) 


(15) 
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The  next  step  in  the  proof  is  the  determination  of  all  the  tetrads.  We  note 
that  if  the  vectors  Vi  (i  =  1,  2,  3,  4)  constitute  a  tetrad,  then 

2>(TJ)=10  (17) 

i 

by  the  definition  of  a  tetrad;  and  for  each  cr(l^)  (by  (15)) 

0«r(JJ)<3.  (18) 

The  only  sets  of  four  integers  satisfying  (17)  and  (18)  are 

(3,  3,  3,  1)     and     (3,  3,  2,  2).  (19) 

The  V-  vector  a  therefore  is  not  admissible. 

Now  the  only  vector  #  of  (15)  for  which  cr(x)  —  I  is  6  and  the  only  one 
for  which  cr(x)  =  3  is/.  Hence  by  (19)  any  tetrad  involving  b  is  of  the  form 

T  -  (b,  &f,  Qif,  Q*f). 

Now  /  contains  just  one  block  of  three  consecutive  O's.  Consider  the 
matrix  whose  rows  are  the  vectors  of  T.  The  three  columns  which  contain 
the  corresponding  block  in  Q1/  must  contain  three  1's  altogether  (definition 
of  a  tetrad)  and  this  is  possible  only  if  they  contain  the  non-zero  com- 
ponent of  b  (by  (i))  which  implies  that  i  =  1,  2,  or  3.  The  same  argument 
applies  with  i  replaced  by  j  or  Tc.  Hence  if  b  is  contained  in  a  tetrad,  that 

tetrad  is  (b,  Qf,  QV,  Q3f), 

and  it  is  easily  verified  that  these  four  vectors  do  indeed  constitute  a  tetrad. 

If  we  apply  an  operation  Qi  to  each  of  the  four  F-vectors  of  any  tetrad  T 
we  shall  clearly  obtain  a  new  tetrad.  We  denote  this  by  Q1T  and  say  that  it 
is  equivalent  to  T.  In  listing  the  tetrads  it  will  suffice  to  give  one  member 
of  each  set  of  equivalent  tetrads.  We  may  therefore  proceed  to  those  tetrads 
which  contain  no  vector  equivalent  to  b.  By  (15)  and  (19)  such  a  tetrad 
involves  just  two  vectors  equivalent  to  /,  and  is  therefore  equivalent  to  a 
tetrad  involving  a  pair/,  Q\f.  We  can  further  suppose^'  less  than  6,  for  the 
operation  Qw~i  transforms  the  above  pair  into  the  pair/,  QlQ~jf  (by  (16)). 

By  comparing  the  first  of  the  six  F-vectors/,  Qf,  Q2f,  Q3/,  <24/,  Q5f  with 
the  other  five,  we  find  that  the  cases  j  =  3  and  j  =  4  are  impossible  since 
for  each  of  these  there  is  an  s  such  that 


contrary  to  the  definition  of  a  tetrad,  but  that  the  other  cases  cannot  be 
ruled  out  in  this  way.  We  may  suppose  therefore  that^'  =  1,  2  or  5. 

Now  /-/-#/=  (0010010011) 


-  Q5c+Q»c, 
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and  these  are  the  only  two  ways  in  which  the  vector  (0010010011) 
can  be  expressed  as  the  sum  of  two  vectors  equivalent  to  members  of  the 
set  (15)  and  satisfying  cr(x)  —  2. 

Again  /-/-  Q2f  =  (0  1  0  0  1  0  0  1  0  1), 

which  can  be  expressed  subject  to  the  above  conditions  only  as 


Finally  /-/-  Q5f  =  (0  0  1  0  1  0  0  1  0  1), 

which  must  be  expressed  either  as 

Q2e  +  Q*e    or  as     Q4c  +  Q9c. 
Every  tetrad  therefore  is  equivalent  to  a  member  of  the  following  set: 

21  =  (b,  QS,  QV,  <m  ) 

T*  -  (/,  QJ,    Q*d,  Q*d 
3i-(/,  Qf,    (fc,  Q*c 


(20) 


Zi  =  (/,  Q5f,  Q*c,   Q»c). 

The  significance  of  the  set  W  of  all  admissible  F-vectors  can  best  be 
understood  in  terms  of  the  dual  map  M*  of  M  .  The  regions  of  M*  are  all 
triangles  and  so  M*  is  a  simplicial  dissection  of  the  sphere.  If  R  is  any  region 
of  M,  we  denote  the  corresponding  vertex  of  M*  by  R*.  The  dual  map 
M  J  of  MP  may  be  defined  as  the  set  of  all  simplexes  of  M*  which  do  not 
have  P*  as  a  vertex.  It  is  therefore  a  simplicial  dissection  of  a  part  of  the 
sphere  bounded  by  a  simple  closed  curve  F*  which  consists  of  the  vertices 
F$,  JF},  ...,  F$  and  the  edges  X"fF$,  F$F$,  ...,  F$  JFf  dual  to  the  five 
distinct  edges  of  MP  which  meet  the  pentagon  P.  (If  there  were  not  five  such 
distinct  edges  of  MP,  some  two  vertices  of  the  pentagon  would  be  joined 
by  an  edge  E  in  MP  and  then  at  least  one  of  the  two  regions  of  MP  incident 
with  E  would  be  incident  with  two  of  the  edges  of  P.  This  would  contradict 
the  requirement  that  the  Ft  must  be  distinct.) 

It  follows  that  the  formal  sum 

S  (W,  *7+i)  =  &  say,  (21) 

i=l 

where  (Ff,  F^+i)  is  a  1  -simplex  of  F*  with  an  orientation  given  by  the  order 
of  the  terms  Ff  and  F$+l,  is  a  bounding  1  -cycle  on  M^, 
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We  can  treat  W  as  a  simplicial  3  -complex  of  which  the  V-  vectors  are  the 
0-simplexes  and  the  tetrads  the  3-simplexes  (and  in  which  each  i  -simplex 
is  incident  with  an  (i  +  l)-simplex  for  i  <  3).  The  correspondence  It*  ->  V(E) 
defines  a  mapping  of  .M*  into  W  which  maps  vertices  on  to  vertices,  and,  by 
the  corollary  to  lemma  II,  2-simplexes  on  to  2-simplexes. 

It  follows  at  once  that  if  the  correspondence  maps  the  2-chain  U  of 
M%  on  to  the  2-chain  Uw  of  W,  where  U  and  Uw  have  ordinary  integers  as 
coefficients,  then  it  maps  the  boundary  of  U  on  to  the  boundary  of  Uw. 
Consequently  bounding  cycles  on  M%  are  mapped  on  to  bounding  cycles 
onW. 

Now  the  bounding  1  -cycle  K  on  Jff  maps  on  to  a  1  -cycle 


-  S  ( 


by  (21) 


ze, 


,  Qe)  +  (Qe, 


e,  e)) 
by  (13)  and  (16). 

But  it  can  be  shown  that  this  is  a  non-bounding  cycle  of  W.  By  proving 
this  we  shall  show  that  the  hypothesis  of  the  existence  of  a  A-  circuit  of 
10  members  leads  to  a  contradiction  and  so  establish  theorem  V.  To  do  this 
we  first  define  a  function  AfP^Pg)  for  each  1  -simplex  (Ji,  JJ)  of  W  by  means 
of  the  equations 


(22) 
(23) 


and  the  following  table. 


TABLE  I 


Reference 
number 

F! 

F2 

A(Flf  VJ 

Keference 
number 

vt 

F2 

A(Flf  Fa) 

1 

6 

Qf 

~1 

12 

d 

<?2/ 

-2 

2 

b 

Q*f 

0 

13 

d 

W 

-1 

3 

b 

Q8/ 

1 

14 

d 

W 

1 

4 

c 

Q*c 

-1 

15 

d 

W 

2 

5 

c 

Q*c 

0 

16 

e 

Q*e 

2 

6 

c 

Qf 

0 

17 

e 

Qf 

-1 

7 

c 

Q2f 

1 

18 

e 

Q*f 

1 

8 

c 

Q5/ 

J 

19 

f 

Qf 

1 

9 

c 

Q«f 

0 

20 

f 

Qzf 

2 

10 

d 

Q3d 

3 

21 

f 

Q*f 

0 

11 

d 

Q2e 

0 

It  may  readily  be  verified  (with  the  help  of  (16))  that  no  two  of  the  pairs 
of  this  table,  even  when  regarded  as  unordered,  are  equivalent  under  the 
operations  of  >Q.  The  definition  of  A(TJ,Ti)  is  therefore  consistent. 

K2 
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Other  assertions  verifiable  from  the  table  with  the  help  of  equations 
(16),  (22)  and  (23)  are:  (i)  A(Ji,Fi)  is  defined  for  every  1-simplex  (VltVz), tnat 
is  for  every  ordered  pair  of  F- vectors  both  contained  in  the  same  tetrad; 
and  (ii)  if  Vv  Vz  and  V$  are  distinct  members  of  the  same  tetrad,  then 

A(Pi,  V2)  +  A(P£,  TJ)  +  A(TJ,  PI)  =  0.  (24) 

It  is  sufficient  to  verify  these  two  assertions  for  each  of  the  tetrads  of 
(20)  by  (23).  This  verification  is  set  out  in  tabular  form  in  Table  II.  The 
numbers  in  the  last  column  of  this  table  are  the  references,  in  order,  to  the 
rows  of  Table  I. 

Equation  (24)  states  that  the  sum  of  the  function  A(Pi,PJ)  over  the 
boundary  of  any  2-simplex  of  W  is  0.  It  follows  from  this,  and  (22)  that  the 
sum  of  the  function  over  any  bounding  1 -cycle  of  W  is  0.  But  its  sum  over 
the  1-cycle  Kw  is,  by  (23)  and  the  preceding  expression  for  Kw, 

5A(e,  Q2e)  =  10     (by  Table  I). 

Thus  Kw  is  shown  to  be  non-bounding  and  the  proof  of  theorem  V  is 
complete. 

TABLE  II 


Tetrad 

FI 

l/2 

y 

A(F    V  ) 

A(F2,  V3) 

A(F    F  ) 

Sum 

References 

8 

2*1 

6 

Qf 

QZf 

-1 

1 

0 

0 

1,  19,    2 

b 

Qzf 

Q3/ 

0 

1 

—  1 

0 

2,  19,    3 

b 

Qf 

Q*f 

-1 

2 

—  1 

0 

1,  20,    3 

Qf 

Qzf 

Qsf 

1 

1 

-2 

0 

19,  19,  20 

T* 

f 

Qf 

Q*d 

1 

-2 

1 

0 

19,  15,  14 

f 

Qf 

Q*d 

1 

1 

-2 

0 

19,  13,  12 

f 

Q6d 

Q*d 

-1 

3 

-2 

0 

14,  10,  12 

Qf 

Q5d 

Q8d 

_2 

3 

-1 

0 

15,  10,  13 

T8 

f 

Qf 

Q*c 

I 

0 

-1 

0 

19,    9,    8 

/ 

Qf 

Q*c 

1 

-1 

0 

0 

19,    7,    6 

/ 

Q6c 

Q9c 

1 

-1 

0 

0 

8,    4,    6 

Qf 

Q6c 

Q*c 

0 

-1 

0 

9,    4,    7 

TI 

f 

Q*f 

Q'd 

2 

-1 

— 

0 

20,  14,  13 

f 

Q*f 

Q*e 

2 

-1 

— 

0 

20,  18,  17 

f 

Q'd 

Q*e 

1 

0 

— 

0 

13,  11,  17 

Q*f 

Q7d 

Q*e 

-1 

0 

0 

14,  11,  18 

T& 

f 

W 

Qze 

0 

-1 

0 

21,  18,  18 

f 

Q5f 

Q*e 

0 

1 

— 

0 

21,  17,  17 

f 

Q*e 

Q*e 

-1 

2 

_ 

0 

18,  16,  17 

Q"f 

Q2e 

Q*e 

—  l 

2 

-1 

0 

18,  16,  17 

!F6 

f 

Q*f 

Q*c 

0 

0 

0 

0 

21,    6,    9 

/ 

Q7 

Q9c 

0 

0 

0 

0 

21,    9,    6 

/ 

Q»c 

0 

0 

0 

0 

9,    5,    6 

Q*f 

Q*c 

Q»c 

0 

0 

0 

0 

6,    5,    9 
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The  simple  method  used  in  the  proof  of  theorem  V  will  not  suffice  to 
prove  the  analogous  theorem  for  n  —  20.  For  Erreraf  has  given  a  map  MP 
and  a  colouring  Z  such  that  a  certain  sequence  of  20  A-operations  transforms 
Z  into  itself.  Perhaps  it  is  significant  that  this  map  contains  a  region  (the 
central  one  in  Kittel's  diagram)  whose  20-vector  V  satisfies  Q\V)  =  V, 
where  Q  and  V  are  defined  by  equations  analogous  to  (14)  and  (11)  respec- 
tively. Even  in  this  case  II  (Z)  contains  colourings  of  type  I. 

Trinity  College 
Cambridge 


Errera,  loc.  cit.  and  Kittel,  loc.  cit. 
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COMBINANT  FORMS  ASSOCIATED  WITH  A  PENCIL  OF  CONICS 

By  J.  A.  TODD 
[Received  17  December  1945 — Read  17  January  1946] 

Introduction 

The  purpose  of  this  paper  is  to  determine  the  complete  irreducible 
system  of  concomitants  of  the  double  form  A\ay>  in  which  #  is  a  ternary 
variable  and  y  a  binary  variable.  If  (#0,  xlt  xz)  are  interpreted  as  coordinates 
in  a  plane,  then  the  equation  A\a,y  —  0  represents  a  pencil  of  conies,  and 
the  forms  of  the  complete  system  of  A\ay  represent  combinants  of  the 
pencil  and  form  a  complete  system  of  such  combinants.  | 

While  many  examples  of  combinants  associated  with  a  pencil  of  conies 
are  well  known,  I  am  not  aware  of  any  previous  systematic  discussion  of  them. 
It  will  appear  that  the  complete  system  to  be  considered  consists  of  65  forms. 
The  details  of  the  reduction  processes  involved  are  somewhat  lengthy,  and 
in  order  to  keep  the  paper  within  reasonable  limits  of  size  many  of  these 
have  been  left  for  verification  to  the  reader.  On  account  of  the  difficulty 
of  manipulating  complicated  symbolic  expressions,  the  reductions  have 
been  accomplished  by  making  use  of  a  simple  canonical  form;  and  their 
verification  is  a  matter  of  elementary  algebra. 

Our  method  of  procedure  is  as  follows.  We  first  construct  an  auxiliary 
system  {M }  of  concomitants  of  the  form  A%.au  which  has  the  property  that 
any  concomitant  ofA^,ay  is  a  form  belonging  to  the  simultaneous  system  of 
the  forms  of  {M},  regarded  as  binary  forms  in  y.  It  is  proved  that  such 
a  system  exists  with  eleven  members,  one  being  cubic  in  y,  two  being 
quadratic,  three  being  linear,  and  the  remaining  five  being  independent 
of  y.  The  second  step  is  the  construction  of  the  complete  system  of  the  forms 
of  {J/},  regarded  as  independent  binary  forms  in  y.  A  system  of  221  forms  is 
thus  obtained.  Next,  the  forms  of  this  system  are  reduced  to  a  set  of  65 
irreducible  forms,  the  reductions  being  effected,  in  part,  by  making  use  of 
syzygies  derived  from  the  canonical  form.  Finally,  we  add  some  remarks 


f  The  term  combinant  is  commonly  applied  only  to  such  forms  as  do  not  involve 
the  parametric  variables  y.  But  the  forms  involving  y  are  also  of  interest,  and  many 
of  them  have  simple  geometrical  interpretations. 
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of  a  geometrical  character,  interpreting  some  of  the  simpler  members  of 
the  system. 

I.   Determination  of  the  auxiliary  system  {M} 

1.  Let  S  =  Sy  =  A%.ay  =  B%by=z...=y0S()  +  ylS1  (1) 
be  the  form  under  consideration.  We  write 

and  represent  by  a',  a"  bracket  factors  of  the  form  (B'C'),  (B"C").  Then 
the  complete  simultaneous  system  of  the  two  ternary  quadratics  $0,  S^ 
consistsf  of  21  irreducible  forms,  which  we  may  take  to  be 

A?.,  A%9  A3,  A"*.;     A^  A"*,  (a'oi"x)*,  (ot,'a"x)A'a.A"a,A'xAx; 

„         n  *'  (3) 

(A'A"u)AxAx,  (oc'a"x)AaL"Axua>)  (a'ot"x)Aa,Axua»', 
( cxf OL" 3r*\  11    ?y       (A f  A  "?/ ^  A "  A '  tj  >    ( A1  A" IL\  Af » An  11 

2,  We  construct  a  system  of  forms  belonging  to  the  complete  system  of 
Axay  with  the  property  that  these  forms  (when  independent  of  y)  or  the 
coefficients  of  the  various  power  products  of  yQ  and  y±  in  these  forms,  are 
equivalent  to  the  set  (3)  in  the  sense  that  either  system  is  expressible 
rationally  and  integrally  in  terms  of  the  other.  We  shall  then  prove  that 
this  system  is  the  system  {M}  desired. 

We  write  down  the  set  of  forms  to  be  considered,  and  then  calculate  their 
coefficients  and  show  that  these  form  a  set  equivalent  to  (3).  The  numerical 
factors  introduced  are  purely  a  matter  of  convenience,  and  are  designed 
with  a  view  to  avoiding  large  numerical  coefficients  in  subsequent  work. 

Consider  the  forms  (eleven  in  number) 


^~  \(ABC]  (ABD)  (ac)  (bd)  CXDX, 


j^^(ABC)  (ABD)  (CEF)  (ac)  (bd)  (ef)DxExFx, 

£  =  &(ABC)  (ABD)  (CEu)  (DFu)  (EFu)  (ac)  (bd)  (ef). 

f  Grace  and  Young,  Algebra  of  Invariants  (Cambridge,  1903),  286. 
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We  have,  immediately, 


(5) 
Next,       y  =  %(A  Bu)  (db)  AXBX 

=  \\_(A'A"u)A'xAx-  (A"A'u)A'xA'x]  =  (A'A"u)A'xAx.         (6) 
And          3Q0  =  (A'B'A")(A'B'u)A"x-(A'B'A')(A'B'u)A'x 

=  A:.Ua,Ax-$(A"B'A')[(A"B'U)A'x-(A''A'U)B'x] 
=  A"a.<ue,AZ-l(A"B'A')[(A"B'A')ux-(B'A'u)A"x] 
=  %A"a.ua.A'x-$A'2ux. 
Proceeding  in  the  same  way  for  Q1  we  see  that 

Co-^X^I-KX.     Q^-iA'^UsA'a  +  tAZus,  (7) 

so  that  QQ  and  Q^  may  replace  the  forms  A'^u^A'^  A^u^A^  in  (3). 
Again, 

A0  =  (A'Btu)[(A'B'C')(C'D"u)D"a.--(A'B'C")(C"D'u)D/x-\. 

The  first  term  vanishes,  on  permuting  A',  B',  Cf  cyclically  and  adding,  and 
the  second  term  is  (D'C"u)  C'^u^D^  =  (A'A"u)A"a,ua'A'x.  So,  treating  At 
similarly,  we  find 

A0  =  (A'A°u)  AlrUafA'0     A!  =  -  (A'A"u)  A'a.ua.A"x.  (8) 

And       T  =  (A'B'u)  [(A'B'C')  (C'D"u)D"x-(ArB"C")  (C'D'u)!^. 

The  first  term  of  this  expression,  on  interchanging  the  equivalent  pairs  of 
symbols  A'C',  B°D",  is 

\(A'B'u)  (C'D"u)  [(A'B"G')D'x-(ArD"C')  B^ 

=  -  l(A'D"u}  (C'B"u)  [(A'B"Cf)Dx-(A'D"C1)  Bx] 

=  l[(A'B"u)  (C'D'u)  -  (A'D"u)  (C'B'u)]  [(A'B"Cf)  D"X-(A'D"G'} 

=  KA'C'u)  (B"D"u)  [(A'B"C')D'X-  (A'D"C')  B'x] 


and,  in  like  manner,  the  second  term  is  equal  to  the  same  expression.  Hence 

T  =  Ka'a'aOtt.'tV-  (9) 

Next, 

-  Qw  =  (ABC)  (ABD)  (ac)  (bd)  CXDX 

=  (A'B'C*)  (A'B'D")  CfxD'x-2(A"B'Cf)  (A"B'D")  C'XD"X 
+  (A"B"C')  (A"B"Dr)  C'XDX 


(the  second  term  being  the  sum  of  two  terms  which  are  equivalent). 
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Here 


and  similarly  T3 

while,  writing  a'  for  (£'C")  and  a"  for 


=  l(A"D"a,'x)(B'C'anx)  =  J(a'a"z)2, 
so  that  -  Qw  =  4$S0  4-  A^Sl  -  f  (aV'a)2,  (10) 

and  (ot'a"x)2  may  be  replaced  by  w  as  an  equivalent  form  in  (3). 

Next,  —  L  is  the  first  transvectant  of  w  and  S  with  respect  to  x.  Hence, 
denoting  such  a  transvectant  by  (w,  $)*,  we  have 


=  -  Ay-/  -  |(SV'  JB'w)  (x'ot'x)  B'x 

=  -  ASy-  K  B>B.  -  B;- 

The  second  term  is 


and  vanishes,  as  we  see  by  permuting  Br,  C',  D'  cyclically  and  adding. 
Thus,  treating  L±  in  the  same  way,  we  have 

so  that  LQ  and  L:  may  be  taken  as  two  of  the  irreducible  members  of  the 
system  (3)  in  place  of  (occt'x)A'cl'A'xua,  and  the  similar  form. 

Lastly,  by  expressing  j  and  £  as  transvectants  of  L  and  S  (with  respect 
to  y  as  well  as  other  variables)  it  is  easy  to  verify  that 

3.  It  follows  from  what  has  just  been  shown,  that  instead  of  the  set  (3) 
we  may  take  the  irreducible  system  of  the  two  ternary  quadratics  SQ  and 
$!  to  be  the  21  forms 

A.,0.,©,,^,^,^^,^,*,-. 

£,  «,  Q0,  ()»  y,  L9,  Ll,  A,,  Ax,  V,l  (13) 

which  are  defined  in  (4),  and  that  these  forms  are  the  coefficients  of  power 
products  of  ?/0,  y±  in  the  eleven  forms  belonging  to  the  complete  system  of  S 
defined  above. 
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Suppose,  now,  that /is  any  form  belonging  to  the  complete  system  of  S. 
Since /is  covariant  with  respect  to  linear  transformation  of  the  variables  x9 
the  coefficients  of  the  various  power  products  of  y0  and  y^  in  /  (or  the  form 
/  itself  if  these  variables  are  absent)  must  be  covariants — in  the  extended 
sense  which  includes  concomitants  of  any  type — of  the  ternary  quadratics 
S0  and  8t  and  hence,  being  rational  integral  functions  of  the  coefficients  in 
80  and  8lt  must  be  expressible  rationally  and  integrally  in  terms  of  the  set 
of  forms  (13).  Since,  further,  /  is  covariant  with  respect  to  linear  trans- 
formation of  the  variables  y,  its  expression  in  terms  of  y0)  y1  and  the  forms 
(13)  is  unaltered  (save  for  multiplication  by  a  power  of  the  determinant  of 
the  transformation)  when  yQ,  y±  are  replaced  by  independent  linear  functions 

2/o,  y(  of  them  and  SQ,  Sly  ...  are  replaced  by  the  corresponding  S'Q,  8'lt  

Thus,  by  definition  of  a  simultaneous  covariant,  /  is  a  simultaneous  co- 
variant of  the  system  u,  8,  S,  7,  A,  Q,  A,  w,  L,j,  £  regarded  as  a  set  of  binary 
forms  in  y.  If  then  we  construct  the  complete  system  of  this  set  [M]  of 
binary  forms  we  shall  obtain  the  complete  system  of  8  by  picking  out  from 
the  simultaneous  system  those  forms  which  are  irreducible  when  the  syzygies 
between  the  forms  (13)  are  taken  into  account. 


II.    Solution  of  the  binary  form  problem 
4.   Regarded  as  binary  forms  in  y  the  forms  of  {M }  fall  into  four  groups. 

(i)  A,  a  cubic  in  y, 

(ii)  £,  A,  quadratics  in  y, 

(iii)  8,  Q,  L,  linear  in  y, 

(iv)  u,  7,  w,  j,  f  which  do  not  involve  y. 

In  forming  the  complete  simultaneous  system  of  {M}  with  respect  to  y 
we  may  ignore  the  last  five  forms,  provided  they  are  included  finally  in  the 
complete  system.  We  have  thus  to  consider  the  complete  simultaneous 
system  of  one  cubic,  two  quadratic,  and  three  linear  forms,  and  in  the 
present  section  we  shall  ignore  any  special  relations  between  these  forms 
which  hold  in  virtue  of  syzygies  connecting  the  forms  (13). 

While  the  complete  system  we  obtain  contains  a  large  number  of  forms, 
its  actual  determination  is  quite  simple,  particularly  as  we  have  no  need 
to  establish  its  actual  irreducibility,  since  the  whole  system  has  in  any 
case  to  be  reconsidered  in  view  of  the  special  relations  connecting  the 
forms, 

It  will  be  convenient  to  denote  by  qr  a  product  of  degree  r  each  of  whose 
factors  is  one  of  the  quadratics  £,  A,  and  similarly  by  lr  a  product  of  degree 
r  whose  factors  are  8,  Q,  L. 
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5  .   We  first  form  the  simultaneous  system  of  A  ,  21  ,  A  .  The  complete  system 
of  A  consists  of  the  four  forms 


A,     #  =  (A,A)2,     T  =  (H,&)\    D  =  (H,H)Z,  (14) 

and  the  simultaneous  system  {B^  of  S  and  A  of  six  forms 

S,     A,     r^CS.A)1,     (S,2)2,     (2,  A)2,     (A,  A)2.  (15) 

The  last  three  forms,  and  the  form  D  of  {A-^,  do  not  involve  y. 

The  complete  system  of  A,  S,  A  is  that  derived  by  trans  vection  from  {A-^ 
and  {-#j}.  Considerations  of  the  orders  of  the  forms  involved  and  the  fact 
that  T  and  F  are  jacobians  showf  that  the  only  trans  vectants  which  need 
be  considered  are 


(A,r)a,         ,,          ,,          ), 


Of  this  set  the  ones  in  the  last  line,  involving  both  T  and  F,  are  all  reducible. 
For,  representing  terms  in  a  transvectant  by  their  bracket  factors, 

(T,  F)2  =  ((#A)  Hy  A2  ,  (SA)  2^  A^)2 
contains  the  term 

(J?A)  (ETL)  (AA)  (SA)  =  (FA)  (SA)  .  (HI,)  (AA) 

=  \[(Hb)*  (2A)2  +  (H2)2  (AA)2  -  (#A)2  (AS)2], 

and  so  is  reducible.  Moreover,  being  a  linear  form,  its  expression  in  terms 
of  other  forms  of  the  system  is  a  sum  of  terms  each  having  an  invariant 
factor.  J  Hence  any  symbolic  product  containing  the  factor  (2T)2  is  re- 
ducible. Since  (T^F)3  and  (AT,g2F)6  contain  such  terms  it  follows  that 
these  transvectants  are  also  reducible. 

Thus  the  system  {A^  derived  by  transvection  from  {A^}  and  {J5X}  consists 
of  41  forms,  as  follows  (in  which  qr  gives  rise  to  r  +  1  forms): 

4  cubics:  A,   T,  (A,^)1; 

6  quadratics:        H,     S,     A,      F,     (H,q±)l'y 


13  linear  forms:     (A,  ft)*,     (!T,ft)a,     (A,  F)2,     (A,</2)3,     (T,q^, 

(A,ftD»; 
18  invariants:        Dt    (S,S)2,    (2,  A)2,    (A,  A)2,    (#,ft)a,    (//,  F)2, 


(17) 


•f  Cf.  Grace  and  Young,  loc.  cit.  ante,  164—165  for  a  discussion  of  the  very  similar 
problem  of  a  cubic  and  one  quadratic. 
J  I.e.  a  factor  not  involving  y. 
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6.   The  complete  system  {B^  of  the  three  linear  forms  S,  Qt  L  consists 
of  the  six  forms 

S,     Q,     L,     (S,Q)\     (S,L)\     (Q,L)\  (18) 

the  last  three  being  invariants.  The  complete  system  of  A,  S,  A,  S,  Q,  L  is  that 
derived  by  transvection  from  {A^  and  {B2}.  Since  all  the  forms  of  {J52}  in- 
volving y  are  linear,  the  complete  system  consists  of  the  forms  of  {A%}  and 
{J52}  together  with  transvectants  (/,  Zr)r,  where/  is  one  of  the  23  forms  of  {A2} 
of  positive  order,  and  r  does  not  exceed  this  order.  From  each  linear  form 
in  {A  2}  we  thus  obtain  three  invariants,  from  each  quadratic  three  linear 
forms  and  six  invariants,  and  from  each  cubic  three  quadratics,  six  linear 
forms  and  ten  invariants.  The  complete  system  derived  from  {A2}  and  {B2} 
thus  contains  4  cubics,  6  +  4.3  =  18  quadratics,  (13  +  3)  +  4.  6  +  6.  3  =  58 
linear  forms  and  (18  +  3)  +  4.  10  +  6.  6  +  13.3  =  136  invariants.  Adding  to 
these  the  five  forms  u,  y,  w,  j,  £  we  have  in  all  221  forms  to  consider;  and  the 
complete  system  of  S  consists  of  such  of  these  forms  as  are  irreducible. 


III.    The  complete  irreducible  system  of  S 

7.  We  now  approach  the  most  complicated  stage  of  our  problem.  The 
system  of  221  forms  which  we  have  just  obtained  includes  all  the  irreducible 
concomitants  of  S,  but  owing  to  the  existence  of  syzygies  between  the 
coefficients  in  the  forms  of  the  set  {M}  many  of  these  221  forms  prove  to  be 
reducible. 

The  first  step  towards  a  systematic  examination  of  these  forms  is  to 
arrange  them  in  ascending  order  of  degree  in  the  coefficients  of  S\  the  process 
of  testing  for  reducibility  proceeding  from  one  degree  to  the  next.  Un- 
fortunately the  number  of  forms  involved  is  so  large  as  to  make  this  list 
inconvenient  to  print.  We  shall  therefore  arrange  the  work  in  the  following 
manner.  First,  we  give  a  list  of  the  65  forms  which  are  finally  shown  to 
constitute  the  irreducible  system  of  S.  Next,  after  establishing  a  simple 
canonical  form  for  S  and  its  more  important  concomitants,  we  shall  show 
that  the  156  forms  not  included  in  the  list  are  reducible.  Finally,  we  shall 
indicate  how  the  actual  irreducibility  of  the  set  of  65  forms  retained  may 
be  established. 

8.  In  the  list  of  forms  which  follows,  the  symbol  [m,  ri\  affixed  to  a  form 
indicates  the  orders  of  this  form  in  x  and  u.  Forms  of  different  orders  in  y 
are  shown  separately,  and  the  arrangement  of  forms  of  given  order  in  y  is 
in  order  of  increasing  degree.  The  65  irreducible  forms  may  be  taken  as 
follows. 
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(i)  Forms  independent  of  y  (27  forms). 
Degree 

0  «*[!,!]; 

2  y[2,l]; 

4  (£,£)2[0,4],     11;  [2,0],     (S.WtS,!]; 

6  £[0,3],     «A,£)»,S)l[2,2],    j[3,0],     (2,  SQ)a  [3,3],     (A,S»)»  [6,0]; 

8  (H,  2)2  [0,  2],     ((A,  S)2,  Q)1  [1,  3],     (Q,  i)1  [3,  2],     (H,  £2)2  [4, 0]; 

10  (H,  A)2  [1,  2],     ((A,  S2)3,  Q)1  [1,  5],     (H,  SQ)*  [3, 1],     (A,  SQ2)3  [4, 2]; 

12  D[0,0],     (H,r)2[l,4],     (H,  Q2)2  [2,  2],     (/f,  S£)2  [4, 1]; 
14     ((r,£)»,Q)i[l,3],     ((H.S)1,^)"  [2,4],     (T,S2£)3  [6,1]; 
16     ((^SJMiJHa.S]; 

18     (AT,  S3)6  [0,6]. 

(ii)  Forms  linear  in  y  (21  forms) 

Degree 

1  S[2,0]; 

3  #[1,1],      (S,^)1^]; 

5  (A,S)2[0,2],     (S.Q^tl.S],     J&[2,1],     (A,  S2)2  [4,0]; 

7  (A,  S2)3  [0,  4],     (A,  A)2  [1,  2],     (H,  S)1  [2, 0],     (A,  SQ)Z  [3, 1]; 

9  (£T,#)i  [1,1],     (A,SA)3  [1,4],     (A,Qa)8[2,2]j 

11  (T,S)2[0,2],     ((H.S)1^)1  [1,3],     (H,  i)1  [2, 1],     (T,S2)2  [4,0]; 

13  (T,S2)3  [0,4],     (T,A)2[1,2],     (T,SQ)*  [3, 1]. 

(iii)  Forms  quadratic  in  y  (13  forms) 
Degree 

2  £[0,2]; 

4  A  [1,2],     (A,£)1  [2,0]; 

6  #[0,0],     (A,W  [1,1],     T[l,4]; 

8  (H,^)1  [0,2],     ((A.S)1,  Q)1  [1,3],     (A, I,)1  [2, 1]; 
10     (^A)1  [1,2],     (T,^)1  [2,0]; 

12  (T,g)i  [1,1]; 

14  (Tji)1^,!]. 

(iv)  Forms  cubic  in  y  (4  forms) 
Degree 

3  A  [0,0]; 

5  (A,  £)*[0,2]; 

7  (A.AJMl^]; 

9  T  [0, 0]. 

9.  Further  discussion  of  the  system  is  simplified  by  the  fact  that  a  very 
simple  canonical  form  exists  for  which  the  expressions  of  all  these  con- 
comitants are  quite  concise.  To  obtain  such  a  form  we  observe,  in  the  first 
place,  that  the  coordinate  system  in  the  plane  can  be  chosen  so  that  the 
four  base-points  of  the  pencil  of  conies  (assumed  to  be  of  general  type) 
have  coordinates  ( ±  1,  ±  1,  ±1).  The  equation  of  every  conic  of  the  pencil 
is  then  of  the  form  a0ajo  +  aiaji  +  a*x*  —  ®  ^^  ao  +  ai  +  «2  =  °-  Next,  by  a 


158  J.  A.  TODD  [17  Jan. 

suitable  linear  transformation  of  y0  and  yl9  we  can  reduce  the  cubic  A  to  a 
multiple  of  y%  +  y\.  This  amounts  to  selecting  as  the  two  conies  $0  and  St 
which  define  the  pencil  the  (unique)  pair  of  conies  of  the  pencil  which  are 
mutually  apolar.  The  equations  of  these  conies  are  x%+a)x\  +  a)zx%  =  0  and 
xl+(i)2x\ +a)x%  =  0,  where  a)  is  a  primitive  cube  root  of  unity.  We  may 
therefore  take,  as  the  canonical  form  of  S,  the  expression 

S^a[yQ(xl  +  ^xl  +  ^xl)+yi(xl  +  ^xl  +  ^xl)}.  (19) 

The  introduction  of  the  factor  a  in  (19)  is  convenient,  since  it  affords  an 
automatic  check  on  the  degrees  of  the  forms  involved  in  any  syzygy,  all 
forms  of  degree  r  having  ar  as  a  factor. 

We  now  introduce  the  following  convention.  Let  /0  be  a  polynomial  in 
XQ)  xl9  x2,  u0,  %,  u2\  and  let/x  and/2  be  the  expressions  derived  from/0  by 
replacing  the  suffixes  (0,1,2)  by  (1,2,0)  and  (0,2,1)  respectively.  We 
shall  write 

^/2     (r  =  0,l,2).  (20) 


It  will  be  observed  that  if/0  is  unaltered  by  cyclic  permutation  of  the  suffixes 
(0, 1,  2)  then  /0  =/,  =  /2,  and  [/0]0  =  3/0,  [f^  -  [/0]2  =  0.  We  shall  also 
use  the  abbreviation 

__  .  >2 ()      e2  = 3  (21) 

The  calculation  of  the  explicit  expressions  for  the  65  forms  listed  in  §  8 
now  presents  no  trouble.  The  eleven  forms  of  {M}  are  calculated  directly, 
and  the  rest  follow  by  forming  the  appropriate  trans vectants.  We  give  the 
complete  list  of  forms  for  reference;  in  forming  this  list  we  have  occasionally 
replaced  certain  of  the  forms  of  the  last  section  by  equivalent  forms  when 
these  have  a  simpler  canonical  expression.  The  complete  list  of  forms  is  as 
follows;  the  order  of  arrangement  is  that  of  §  8. 

1.  u  =  [w0#0]0. 

2.  7  =  eaz['M0x1a:2]0. 

4.  w'  =--a*[x%]0. 

5.  (S,  Q)1  =  —  3a4[w0#0]0  —  wu. 

9.     (£,  SQ)Z          =  3eae[-u0c 
11.     (H ,  2)2  =  2a8[w2] 

14.     (H,S*)*  =~2a8[^ 

16.     ((A, S3)3,  Q)1   =  —  6eaw[uQxQ(ul  —  u\)(u\  +  u\  — 
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17.  (H,SQ)*         =  -ea™\u0xQ(x\-xl}\. 

18.  (A,S<?2)3         =  3ea1Q[u*x*(xl-xl)]0  +  2(H,SQ)*u. 

19.  D  =  -2a12. 

20.  (H,  T)2  =  ~  l2alz[ulUzx0(ul-uz2)-]Q. 

21.  (#,Qa)2 

22.  (H,SL)* 

23.  ((T,IC)2,g)i 

24.  «H,E)\Q2)2 

25.  (T,^g)« 

26.  ((T,£)2,£)i 

27.  (A^S3)6 

28.  S 

29.  g 

30.  (S,^)1 


31.  (A,S)2 

32.  (E,  g)1  -  -  3a»(y0[Ml  w,(a^t«a  +  ^a  ^JJ!  + 

33.  L  =  ea»(i/0[w0aj1ica]1-2/1[w0a;1a;t]2). 

34.  (A,  ^2)2 

35.  (A^S2)3 

36.  (A,  A)2 

37.  (H,^)1 

38.  (A,^Q)2 

39.  (H,  Q)1 

40.  (A5SA)3 

41.  (A,  £2)2 

42.  (T,L)2 

43.  ((H,S)Sg)i 

44.  (//,  iv)1  =  —  ea11(2/0[w0a;1a;a]1  +yi[w0a;lara]a) 

45.  (  T,  S2)2 

46.  (T,S2)3 

47.  (T,A)2 

48.  (T,SQ)* 

49.  S 

50.  A  =  ~2ea4(i/ 

51.  (A,^)1  =a%o[^o2] 

52.  H  =  2a«2/02/!. 
63.  (A,  Q)1              =  -  a«(7/2[ 
54.  T 


55.  (H.S)1 

60.  ((A^)1, 

57.  (A,  I^)1 

68.  <H,A)i 

59.  (T,  S)1 

60.  (T,g)i 

61.  (T,^)1 

62.  A 

63.  (A,^)1 

64.  (A,  A)1 

65.  T 


160  J.  A.  TODD  [17  Jan. 

10.  We  shall  now  show  that  the  remaining  156  forms  of  the  system  can 
be  expressed  in  terms  of  the  65  which  we  have  obtained  explicitly  in  §  9. 
In  the  case  of  forty  of  these  forms  we  do  this  by  writing  down  explicitly  the 
expression  in  terms  of  forms  of  the  irreducible  system.  Many  of  these 
syzygies  were  somewhat  difficult  to  discover;  the  procedure  being  to  write 
down  all  possible  products  of  forms  of  the  irreducible  system  of  the  correct 
degree  and  orders  in  the  variables,  and  to  see  if  the  form  it  is  proposed  to 
reduce  is  linearly  expressible  in  terms  of  these  by  using  the  canonical  form. 
Once  obtained,  however,  the  verification  of  the  syzygies  is  elementary 
algebra  (though  in  the  case  of  the  more  complicated  relations  the  calculation 
is  somewhat  tedious).  It  is  therefore  unnecessary  to  do  more  than  write  the 
syzygies  down,  and  this  is  done  below  in  equations  numbered  (22)-(61). 
We  shall  show  then  that  the  reducibility  of  the  other  116  forms  is  a  con- 
sequence of  that  of  the  forty  for  which  the  syzygies  are  given.  The  forms 
reduced  by  syzygy  comprise  one  quadratic  [equation  (22)],  sixteen  linear 
forms  [equations  (23)-(38)],  and  twenty  -three  forms  independent  of  y 
[equations  (39)-(61)].  They  are  as  follows: 

((A,  S)1,  A')1  =  *  (  A,  2)2  8  -  \wZ  -Q2  +  2(  A,  Q)i  u  -I-  %Hu*,  (22) 

(A,  tf)1--  2Lu  +  2Qy9  (23) 

(S,  LY  =  -  *(A,  S)2y  +  f  (A,  A)2  u  -  3£S,  (24) 

(25) 
(26) 

)*u-2(&9Q*)*9       (27) 
(A,£)i=(A,A)2y,  (28) 

(A,  8L)*  =  -  (H,  SYy-Lw-  3jQ.  (29) 

The  form  (A,  SP)8  can  be  replaced  by  its  term  (AF)2  (AS)  Sy  =  ((A,  F)2,  S)1, 
and  hence,  using  (26),  by  ((A.SA^S)1.  And 

((A,SA)»,S)i  =  -|(S,S)2(A,A)2-6C(S,  Q}\  (30) 

(A,  A2)3  =12£L.  (31) 

The  form  ((A,  S)1  Q2)2  can  be  replaced  by  its  term 


and        ((A,  Q2)2,  S)i  =  i(S,  S)2  (H,  S)1  -  f(#,  S)2  (S,  /S)1  +  (H,  E)2  Qu 

+  2((H,  S)1,  Q)1  u  -  i«?(  A,  S2)3,  (32) 

(A,  QL)2  =  i(  A,  A)2  w  -  i(#,  A)2  8  -  (H,  Q?  y.  (33) 
The  form  (A,  AF)3  can  be  replaced  by  its  term  (AF)2  (AA)  Av  =  ((A,  T)2,  A)1, 
and  hence,  using  (26),  by  ((A,SA)3,  A)1.  And 

((A,SA)a,A)1  =  3(A,A)2^-9(A,E)2^7-18g2ASf,  (34) 

(A,  L2)*  =  -  (H9  LY  y  -  f  (A,  A)2  j.  (35) 
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The  form  (T,S  A)3  may  be  replaced  by  its  term  (TS)2(TA)  Ay  =  (( 
and 


((T,S)2,A)i  =  (A,  A)*  (^,2)*+  12(^,0^,  (36) 

(T^2)2  =  l(H,V)*(H,SF-k(T,X)*w  +  ±D(X,S)*-DQu,      (37) 

(T,  A2)3  =  (Ht  A)2  (A,  A)2  -  1  2(H,  L)1  £  ,  (38) 

(S,£2)2  =  -4(£,e)iw-2y2  +  2^2,  (39) 

(2,A)a=12£tt,  (40) 

(£,  jD)1  =  -Zju-wy,  (41) 

(A,  A)2  =  24f7j  (42) 

(S,Q2)2  =  -((A,S)2,e)i^-(^,S)2w2+6?r,  (43) 

((A,  S2)3,  ^8f)i  =  6((A,  S)2,  C)i  w  +  3(H,  2)2  u*  -  12gy  -  (S,  S)2  w,         (44) 

((A,A)2,^)^2(g,L)i,  (45) 

(A,£2<?)3  =  i^fif^w  +  w^-^Oiw+Sjy,  (46) 

(47) 
(48) 
(49) 
(50) 

(A,  ^3f  =  _  f  (#,  «2)2^-  JZ>^_  9jg,  (51) 

(52) 
(53) 


The  form  (A2,  S2A)6  may  be  replaced  by  its  term  (AH)2(AS/)(A/S')(A/A)2 
which  is  equal  to  ((A,  A)2,  (A,  22)3)1.  And 


)2,e)H-6(H,2;)2^,        (54) 
,  E2)3,  S)1  =  -  i(U  ,  S)2  ((A,  S)2,  fif)i  +  2((H,  S)i,  Q2)2 

M,  (55) 

(56) 


)2,  0)i  (IT,  2)2 

)2]2  tt  -  JI>(S,  S)2  «  +  3(  jy  ,  A)2  f,  (58) 

2,^i)  (69) 

2)2  (jy  ,  2  )2  -  (H,  S)2  w?a  -  (^,  Q*)*W 

-D(S9  Q^u  +  Dwu*,  (60) 


)2,^^^ 

(61) 
SEB.  2.    VOL.  50.    NO.  2389.  r 
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H.  We  shall  now  show  how  all  the  221  forms  are  accounted  for.  The 
four  cubic  forms  in  y  are  irreducible.  Of  the  eighteen  quadratics,  thirteen 
are  included  hi  the  irreducible  system,  and  one  is  reduced  by  (22).  The  other 
four  are  ((A,  S)1,  L)1  and  ((A,  A)1,  l)\  where  liaS>Q  or  L. 

Now 


But,  by  (24),  (S,  L)1  is  expressed  as  a  sum  of  products  of  forms,  each  product 
containing  a  factor  independent  of  y.  Hence  (A,  (S,  L)1)1  is  expressible  in 
terms  of  products  of  forms  of  lower  degree  or  order,  and  is  reducible.  Thus 
((A,  S)1,  L)1  is  reducible.  Similarly,  since  the  forms  (A,  I)1  are  reducible  by 
(23),  (25),  (28),  the  three  forms  ((A,  A)1,^1  are  reducible. 

Thus  the  five  quadratics  not  included  in  the  irreducible  set  are  all 
accounted  for. 

The  58  linear  forms  comprise 

(a)  the  13  forms  in  (17)  together  with  8,  Q,  L, 

(b)  18  forms  (#',/)*,  where  q'  is  one  of  the  six  quadratics  in  (17)  and  I  is 
S,  Q  or  L, 

(c)  24  forms  (c,  Z2)2,  where  c  is  one  of  the  four  cubics  and  lz  is  a  product  of 
two  of  S,  G,  L. 

Of  the  16  forms  (a),  ten  are  included  in  the  irreducible  system.  The  other 
six  are  reduced  by  (26),  (30),  (31),  (34),  (36),  (38). 

Consider  next  the  forms  (6).  Those  in  the  irreducible  system  are 


The  transvectants  (S,  L)1,  ((#,  Z)1,  £)1  are  reduced  by  (24)  and  (27).  The 
transvectant  ((H,  L)1,  L)1  may  be  replaced  by  its  term 


and  hence  reduced  by  (24).  The  forms  (A,  I)1  are  reduced  by  (23),  (25),  (28). 
And  the  forms  ((H,  A)1,  1)1  may  be  replaced  by  (H,  (A,  ^)1)1and  are  accordingly 
reducible.  Lastly,  (I\Z)1  contains  the  term  (SA)(AZ)£y  =  (S,  (A  J)1)1  and 
is  accordingly  reducible.  Thus  all  the  forms  (b)  are  accounted  for. 

Consider  next  the  forms  (c).  Of  the  six  forms  (A,  12)2  the  three  in  which  12 
contains  a  factor  L  are  reduced  by  (29),  (33),  (35).  Moreover,  (T,12)2 
contains  the  term  (H&)(kl2)2Hy  =  (H,  (A,  y2)1  and  hence  is  reducible  if 
(A,  J2)2  is.  Thus  (T,  Z2)2  is  reducible  if  lz  contains  the  factor  L,  and  (T,  Q2)2 
is  reduced  by  (37).  Since  ((A^)1,?)1  is  reducible,  so  also  is  ((A,  A)1,^)2. 
Similarly,  since  ((A,  S)1,  Z)1  is  reducible  if  I  is  S  or  L,  the  only  transvectant 
((A,  S)1,  Z2)2  which  remains  for  consideration  is  that  in  which  lz  —  Q2,  and 
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this  is  reduced  by  (32).  Thus  nineteen  forms  (c)  reduce;  the  other  five  are 
listed^in  the  irreducible  system. 

The  58  linear  forms  are  thus  accounted  for. 

Consider  next  the  remaining  141  forms,  independent  of?/.  We  may  arrange 
these  in  the  following  groups  : 

(a)  The  five  forms  u,  y,  w,  j,  £. 

(6)  The  18  invariants  of  the  set  (17). 

(c)  The  three  forms  (8,  Q)\  (S,  L)1,  (Q,  L)1. 

(d)  39  forms  (l',l)1,  where  I'  is  one  of  the  13  linear  forms  (17)  and  I  is 
S,  Q,  L. 

(e)  36  forms  (q'  ',  Z2)2,  where  q'  is  one  of  the  six  quadratics  (17)  and  lz  is  a 
product  of  two  forms  S,  Q,  L. 

(/)  40  forms  (c,  Z3)3,  where  c  is  one  of  the  four  cubics  (17)  and  13  is  a  power 
product  of  S,  Q,  L  of  degree  3. 

The  five  forms  (a)  are  included  in  our  irreducible  system. 

Of  the  forms  (6),  the  irreducible  system  includes  D,  (£,S)2,  (tf,E)2, 
(tf,A)2,  (H,  F)2  and  (AT,£3)6,  six  forms  in  all.  The  four  forms  (S,A)2, 
(A,  A)2,  (A2,S3)6,  (A2,S2A)6  are  reduced  by  (40),  (42),  (49),  (54).  The  re- 
maining eight  forms  are  (A2,  g^A2)6,  (A2,#2r)6,  (AT,  <72A)6,  where  qi  is  a 
homogeneous  product  of  degree  i  in  S  and  A  ;  and  these  contain,  respectively, 
terms  with  the  symbolic  factors  (AA)2(AA')5  (AF1)2,  and  either  (TA)2(TZ) 
or  (TA)2(TA').  But  (A,  A2)3,  (A,  T)2,  (T,SA)3,  (T^A2)3  are  reducible  by 
(31),  (26),  (36),  (38).  Hence  all  the  forms  in  question  are  reducible.  The 
forms  (8,  Q)1,  (Q,  L)1  are  irreducible,  but  (8,  L)1  is  reduced  by  (41). 

In  (d),  we  need  only  consider  the  cases  (seven  in  number)  in  which  I'  is 
irreducible,  i.e.  those  for  which  I'  is  one  of  the  set 

(A,£)2,  (A,E2)3,  (A,  A)2,  (A,SA)3,  (T,£)2,  (T,S2)3,  (T,A)2.    (62) 

Since  (A,?)1  is  reducible  and  (A,  SA)3  contains  the  term  (AE)2(AA)  Ay, 
the  three  forms  ((A.SAJM)1  are  reducible.  Similarly,  ((A,  £2)3,  i)1  and 
((T,S2)3,  L)1  are  reducible  since  (S,  L)1  is  reducible.  Other  forms  (L',1)1 
with  V  in  the  set  (62)  are  reduced  by  (44),  (45),  (47),  (48),  (50),  (52),  (55), 
(57),  (58),  (59),  (61).  Thus,  of  the  21  forms  arising  from  (62),  16  have  been 
reduced.  The  remaining  five,  namely, 


appear  in  our  list  of  irreducible  forms. 

Consider  next  the  36  forms  (e).  Of  the  six  forms  (£M2)2,  (Ht  QL)*  and 
(H,  SL)2  are  reduced  by  (56)  and  (60).  The  other  four  appear  in  the  irre- 
ducible list.  Since  (S,  L)1  is  reducible  we  need  only  consider,  among  the 
forms  (2,  Z2)  2,  those  in  which  lz  does  not  contain  a  factor  L.  Of  these  forms 

L2 
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(S,  S2)2  and  (S,  Q2)2  are  reduced  by  (39)  and  (43).  The  remaining  form 
(£,  SQ)2  is  in  our  irreducible  list.  Since  all  the  forms  (q',  I)1  with  q'  =  A,  F 
or  (H,  A)1  have  been  reduced,  all  the  corresponding  forms  (q',l2)2  are  reducible. 
Finally,  since  the  only  irreducible  trans  vectant  ((H,  S)1,  1)1  is  that  for  which 
I  —  Q,  the  only  form  ((J?,  S)1,  Z2)2  to  be  considered  is  that  for  which  12  =  Q2, 
and  this  in  fact  appears  in  our  irreducible  list.  Thus  all  the  forms  (e)  have 
been  considered. 

Lastly,  consider  the  40  forms  (/).  Since  the  only  irreducible  forms  (c,l2)2 
have  been  shown  to  be 

(A,£2)2,     (&,SQ)2,     (A,Q2)2,     (T,S2)2,     (T,SQ)2, 
all  the  forms  (c,  /3)3  are  reducible  with  the  possible  exception  of 


where  ^  is  a  product  of  degree  i  in  S  and  Q  only.  Of  these  six  forms  (A, 
(A,Q3)3  and  (T,  £3)3  are  reduced  by  (46),  (51),  (53).  The  other  three  are 
included  in  the  list  of  irreducible  forms. 

The  above  examination  shows  that  all  the  221  forms  have  been  considered, 
and  that  accordingly  the  system  of  65  forms  listed  is  complete. 

12.  Attempts  to  express  forms  of  the  complete  system  in  terms  of 
products,  of  the  same  degree  and  orders  in  the  variables,  of  other  forms  of 
the  system  having  been  unsuccessful,  it  is  believed  that  the  sixty-five 
forms  enumerated  above  constitute  the  complete  irreducible  system.  An 
example  will  illustrate  the  procedure.  We  shall  prove  that  (A,  SA)3  cannot 
be  expressed  in  terms  of  the  other  forms  of  the  system.  (The  irreducibility 
of  this  particular  form  was  in  fact  doubted,  since  alone  among  the  linear 
forms  in  y  it  is  not  accompanied  by  a  correlative  form  in  which  the  variables 
x  and  u  are  interchanged.)  The  form  in  question  is  of  degree  9,  and  of  orders 
1,  1,  4  in  y,  x,  u  respectively.  We  select  from  the  list  of  forms  those  of  degree 
r  and  orders  n1}  nz,  n3,  where  r  ^  9,  n^  ^  1,  nz  ^  I,  n3  ^  4.  Denoting  such  a  form 
/  by  the  symbol  /(r;  n^  n2,  %)  we  have 

u(0;  0,1,1),     Q(3;  1,1,1),     (S,S)2(4;  0,0,4),     (A,S)2(5;  1,0,2), 
(S,^)1  (5;  1,1,  3),     £(6;  0,0,  3),     (A,  S2)3  (7;  1,0,  4),     (A,  A)2  (7;  1,  1,  2), 
(#,Z)2(8;0,0,2),     ((A,  S)2,  OMB;  0,1,  3),     (JI,g)i(9;  1,1,  1). 

The  only  products  of  these  forms  which  gives  a  form  (9;  1,1,4)  is  £Q. 
Reference  to  the  list  of  standard  forms  shows  that  (A,  2  A)3  is  not  a  multiple 
of  £Q.  Hence  it  is  irreducible.  The  other  forms  have  been  tested  in  a  similar 
way,  and  found  to  be  irreducible. 
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IV.    Geometrical  interpretation  of  the  system 

13.  We  shall  conclude  by  interpreting  some  of  the  simpler  forms  of  the 
system  geometrically.  The  geometrical  significance  of  some  of  the  more 
complex  forms  of  the  system  is  somewhat  obscure.  Moreover,  the  irreduci- 
bility  of  a  form  is  frequently  of  little  geometrical  significance.  For  instance, 
the  form  (A,  A)1  is  certainly  irreducible.  On  the  other  hand,  the  same  form 
multiplied  by  the  invariant  D  is  reducible,  since,  as  may  be  verified  at  once 
from  the  table  of  canonical  forms, 

.D(A,A)1  =  -(H,A)*T  +  (T,  A)2#.  (63) 

An  investigation  in  detail  of  the  geometrical  meaning  of  all  the  forms  in- 
volved would  thus  not  only  be  lengthy  but  pointless.  We  shall  content 
ourselves  with  identifying  a  few  of  the  simpler  members  of  the  system  with 
well-known  geometrical  loci  associated  with  the  pencil  of  conies;  and  (in 
particular)  showing  that  all  the  forms  of  our  system,  involving  a  single 
variable  only,  admit  of  simple  geometrical  interpretations.  The  proofs  of 
most  of  the  statements  made  below  are  left  to  the  reader;  they  arise  almost 
immediately  by  considering  the  canonical  forms. 

14.  The  equation  8  =  0  represents,  for  each  value  of  (f/0>2/i)>  a  conic  of 
the  pencil.  The  tangential  equation  of  this  conic  is  S  =  0.  This  is  a  binary 
quadratic  in  «/0,  yl9  and  the  discriminant  of  this  quadratic,  (£,S)2,  is  a 
quartic  in  (u0,  u^  u2)  which  when  equated  to  zero  is  easily  recognized  as  the 
tangential  equation  of  the  set  of  base-points  of  the  pencil.  The  form  A  is  the 
discriminant  of  S,  and  the  equation  A  =  0  determines  the  three  values  of 
2/o :  2/i  f°r  which  the  conic  becomes  a  line-pair.  The  form  (A,  $3)3,  which  is 
a  sextic  in  (xQ9xl9x2)9  is  easily  verified  to  be  the  product  of  the  left-hand 
members  of  the  equations  of  the  three  line-pairs  in  the  pencil. 

The  canonical  form  of  the  hessian  H  of  A  is  a  multiple  ofy0ylf  so  that  the 
two  conies  whose  parameters  satisfy  H  =  0  are  $0,  S±  and  are  thus  given  by 
(H,  S2)2  =  0.  These  two  conies  are  characterized  by  the  property  that  they 
are  mutually  apolar,  and  the  four  base-points  of  the  pencil  form  an 
equianharmonic  set  on  either  of  them.  The  equation  w  =  0  is  the  harmonic 
locus  of  these  two  conies,  and  is  a  well-known  combinant  of  the  pencil;  in 
the  canonical  form  under  discussion  the  equation  of  w  is  x%  +  x\  -f  x\  =  0,  so 
that  its  tangential  equation  is  u%  +  u\  +  u\  —  0,  which  is  at  once  identified 
with  (#,£)2  =  0. 

The  cubic  covariant  T  of  A  determines  the  parameters  of  three  other 
conies  of  the  pencil.  It  is  easy  to  verify  that  these  conies  have  double  contact 
with  w\  the  chords  of  contact  forming  the  sides  of  the  common  self -polar 
triangle  of  the  pencil.  They  also  have,  as  is  well  known,  the  property  that 
the  base  points  of  the  pencil  form  a  harmonic  set  of  any  one  of  them.  The 
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equations  of  these  conies  are  given  by  (jP,  $3)3  =  0,  a  form  which  has  been 
seen  to  be  reducible. 

The  single  invariant  D  is  the  discriminant  of  A,  and  its  vanishing  expresses 
the  condition  that  the  base  points  of  the  pencil  should  not  all  be  distinct. 

The  forms  j  and  f  are  easily  identified,  j  —  0  being  the  equation  of  the 
sides  of  the  common  self-polar  triangle  and  £  =  0  the  tangential  equation 
of  its  three  vertices. 

The  only  form  not  already  mentioned  which  includes  only  a  single  variable 
is  (A2,  S3)6,  a  sextic  in  u.  A  glance  at  the  list  of  canonical  forms  shows  that 
(A2,  S3)6  =  0  is  the  tangential  equation  of  the  six  points  whose  coordinates 
are  permutations  of  (1,  —1,  0);  and  these  points  are  the  intersections  of 
the  sides  of  the  self -polar  triangle  with  joins  of  pairs  of  the  base-points. 

15.  It  will  be  noticed  that  in  a  large  number  of  cases  to  each  form  in  the 
system  there  corresponds  another  form  whose  canonical  expression  (apart 
from  powers  of  a  and  numerical  multipliers)  is  obtained  from  that  of  the 
given  form  by  interchanging  (XQ,  x^  xz)  and  (u0,  u^  u2).  Two  such  forms  are 
evidently  polar  reciprocals  with  respect  to  the  combinant  conic  w.  It  does 
not,  of  course,  follow  that  the  reciprocal  of  an  irreducible  form  is  necessarily 
irreducible,  and  there  are  in  fact  a  few  cases  where  the  reciprocal  form  does 
reduce.  None  the  less,  many  of  the  forms  of  the  irreducible  system  do  in 
fact  appear  in  pairs  which  are  related  in  this  way.  Thus,  for  instance,  the 
form  (A,  S)2  =  0  represents  a  tangential  pencil  of  conies,  which  is  the  polar 
reciprocal  of  the  given  pencil  with  respect  to  w.  It  may  be  verified  that  the 
point  equation  of  the  envelope  (A,  S)2  =  0  is  a  linear  combination  of  (T,  S)1 
and  Hw. 

It  will  also  be  observed  that  a  number  of  forms  appear  in  pairs,  one  of 
which  is  derived  from  the  other  by  replacing  (2/0>2/i)  by  (y0,  —  y±) — again 
ignoring  factors  independent  of  u,  x  and  y.  This  corresponds  to  the  fact  that, 
with  each  conic  S  of  the  pencil  is  associated  another  conic  8'  given  by 
(H,  S)1  =  0,  such  that  the  parameters  of  $,  S'  harmonically  separate  the 
parameters  of  the  mutually  apolar  conies  S0,  S±. 

16.  We  next  give  geometrical  interpretations  of  the  forms  y,  §,  A  and  L. 
If  (u0,  u19  uz)  are  the  coordinates  of  a  line  Z,  then  y  =  0  is  easily  recognized 
as  the  conic  which  is  the  locus  of  poles  of/  with  respect  to  conies  of  the  pencil 
(the  eleven-point  conic  of/),  or,  alternatively,  if  (x0tx19x2)  are  coordinates 
of  a  point  P  then  y  =  0  is  the  tangential  equation  of  the  point  P'  which  is 
conjugate  to  P  with  respect  to  every  conic  of  the  pencil.    The  reciprocal 
form  with  respect  to  w  is  (H ,  A)2  =  0,  and  may  be  interpreted  similarly. 

The  interpretation  of  the  form  Q  is  a  little  less  simple.  If  (x0,  xl7  x2)  are 
coordinates  of  a  point  P  and  (?/0,  y±)  are  parameters  of  a  conic  8  of  the  pencil, 
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then  Q  =  0  is  the  tangential  equation  of  a  point  P*  such  that  the  point  Pf 
conjugate  to  P  with  respect  to  the  pencil  of  conies  defined  by  S0  and  St  is 
conjugate  to  P*  with  respect  to  the  pencil  of  conies  defined  by  $  and  w. 

The  forms  A  and  L  may  be  interpreted  as  follows.  If  (uQtul9uz)  are  the 
coordinates  of  a  line  I  and  (y0>2/i)  are  parameters  of  a  conic  S  of  the  pencil, 
then  A  =  0  is  the  equation  of  the  tangent  to  the  eleven-point  conic  of  I 
with  respect  to  the  pencil  at  the  point  which  is  the  pole  of  I  with  respect 
to  St  and  L  =  0  is  the  equation  of  the  eleven-point  conic  of  I  with  respect 
to  the  pencil  defined  by  S  and  w. 

17.  We  conclude  by  pointing  out  that  the  conditions  which  ensure  that 
the  pencil  of  ternary  quadratics  belongs  to  a  given  one  of  the  known  types 
can  all  be  expressed  in  terms  of  the  vanishing  of  forms  of  the  complete 
system.  The  actual  conditions  can,  of  course,  be  stated  in  many  cases  in 
a  variety  of  ways.  The  classification  we  adopt  here  is  based  on  the  following 
considerations. 

First  consider  the  non-singular  case  in  which  A  =£  0.  Then,  if  D  =j=  0  we 
have  the  general  type  of  pencil,  in  which  all  the  base  points  are  distinct. 
If  D  =  0  and  H  4=  0  then  there  are  two  distinct  singular  conies  in  the  pencil. 
Two  ca-ses  arise,  according  as  the  double  root  of  the  cubic  A  =  0  represents 
a  line-pair  or  a  repeated  line.  In  the  former  case,  which  is  the  general  one, 
the  conies  have  simple  contact;  in  the  latter  they  have  double  contact.  In 
this  case  the  form  j  vanishes  identically  (and  so  also  does  £).  If  H  ==  0  the 
three  roots  of  A  =  0  are  coincident,  and  the  conies  have  three  point  or 
four  point  contact  according  as  the  singular  conic  in  the  pencil  is  a  line-pair 
or  a  repeated  line.  In  the  latter  case  the  forms  j,  £,  w  all  vanish.  Finally, 
the  conies  of  the  pencil  are  all  coincident  if  7  vanishes  identically.  Thus  the 
non-singular  types  can  be  classified  as  follows. 

General  case:  Z)=j=0. 

Simple  contact :  D  =  0,  H  =f=  0,  j  4=  0. 

Double  contact:  j  =  0,  7/  +  0. 

Three-point  contact :  H  =  0,  j^  0,  A  4=  0. 

Four-point  contact:  w  =  0,  A=|=0. 

Conies  coincide:  y  =  0,  A  4=0. 

For  the  singular  pencils,  in  which  A  vanishes  identically,  it  is  easy  to  see  that 
if  the  conies  have  a  fixed  double  point  all  the  forms  vanish  except  $,  S,  y 
and  their  transvectants  with  respect  to  y.  The  two  cases  which  arise  according 
as  the  pencil  of  line  pairs  in  involution  is  general  or  degenerate  are  dis- 
tinguished (as  a  simple  calculation  shows)  by  the  fact  that  the  form  (£,  2)2 
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vanishes  in  the  latter  case  and  not  in  the  former.  Finally,  the  conies  are 
coincident  if  y  —  0.  Thus  the  following  singular  cases  arise.  For  brevity 
we  have  described  them,  by  writing  down  typical  canonical  terms;  the 
nature  of  the  pencil  is  quite  clear  from  any  one  of  these. 

A  =  0,  <2=f=0. 

A  =  0,  Q  =  0,  (S,S)»*0. 

A  =  0,  (2,2)2  =  0, 

A  =  0,  7  =  0,  S=f=0. 


Trinity  College 
Cambridge 
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ON  THE  MULTIPLICATORS  OF  SOME  CLASSES  OF 
FOURIER  TRANSFORMS 

By  RAOXJF  Doss 
[Received  11  December  1943. — Read  16  December  1943] 

Introduction  and  Definitions 

Consider  the  following  classes  of  functions  (real  or  complex)  of  the  real 
variable  x,  which  will  be  supposed  to  be  always  absolutely  integrable  in  the 
sense  of  Lebesgue  in  the  interval  ( —  oo,  +00): 

{1}  Absolutely  integrable  in  the  sense  of  Lebesgue  (without  any  other 

restriction). 

{2}  Bounded  in  the  interval  ( —  oo,  +00). 
{3}  Bounded  in  the  interval  ( —  oo,  +00)  and  integrable  in  the  sense  of 

Riemann  in  every  finite  interval. 
{4}  Uniformly  continuous  in  the  interval  (  —  00,  +00). 
{5}  Of  bounded  variation  in  the  interval  (  —  00,  4-oq)  (we  mean  that  the 

variation  of  the  function  in  every  finite  interval  is  bounded  by  a 

constant  independent  of  the  interval). 
{6}  Absolutely  continuous  in  the  interval  (  —  00,  +00)  and  of  bounded 

variation  in  the  same  interval. 

It  is  clear  that  every  class  of  functions  is  contained  in  the  preceding  one, 
except  for  the  class  {5}  which  is  not  contained  in  {4}  but  in  {3}. 

DEFINITION  1.  A  real  or  complex  function  g(u)  of  the  real  variable  u 
denned  in  (  —  00,  +00)  is  said  to  belong  to  the  class  (Fk)  (k  =  1, 2,  3, 4, 5,  6) 
when  it  is  the  Fourier  transform  of  a  function  of  the  class  {k},  i.e.  if  there 
exists  a  function  f(x )e{&}  such  that 

9W  =  ~ l*J(x)er***dx     (i  =  V(~  !))• 

DEFINITION  2.  The  real  or  complex  function  A(u)  of  the  real  variable  u 
denned  in  ( —  oo,  +00)  will  be  called  a  multiplicator  (j,  k)  (1  ^  j,  k  ^  6)  if  its 
product  by  every  function  of  the  class  (Fj)  is  an  element  of  the  class  (Fk). 
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We  shall  be  concerned  in  this  paper  with  the  following  problems: 

(A)  The  function  g(u)  defined  in  (  —  oo,  +00)  being  given,  to  find  necessary 
and  sufficient  conditions  for  g(u)  to  belong  to  a  given  one  of  the  six  classes  (Fk). 

For  the  class  (Ft)  this  problem  has  been  solved  by  A.  C.  Berry.f  In 
theorem  I  is  given  another  criterion  which  seems  to  be  easier  to  handle  in 
applications.  For  the  five  other  classes  the  conditions  are  given  in  theorems 
II,  III,  IV,  V  and  VI. 

(B)  The  function  A(^)  being  given,  to  find  necessary  and  sufficient  con- 
ditions for  \(u)  to  be  a  multiplicator  (  j,  k)  (j,  k  being  fixed). 

Using  the  characteristic  properties  of  the  functions  g(u)  of  the  classes 
(Fk)  and  with  the  help  of  an  existence  theorem  of  Steinhaus  and  of  an 
analogous  theorem  given  below  (theorem  VII),  and  finally  applying  a  theorem 
of  Verblunsky  and  other  known  results  grouped  in  §  1  ,  I  give  the  solution 
of  this  problem  of  multiplicators  in  thirty-five  cases.  The  conditions  ob- 
tained are  of  a  very  simple  form.  They  express  the  fact  that  one  or  other 
of  the  three  functions  i\(u)  \(u) 

- 


with  i  —  A/(  —  1),  belongs  to  one  of  the  six  classes  (Fk). 


{1} 


{2} 


{3} 


{4} 


{5} 


{6} 


{1} 


{2} 


{3} 


{4} 


{6} 


We  group  them  in  the  above  table,  where,  to  find  the  condition 
characterizing  multiplicators  (1,5)  for  example,  we  must  take  the  square 
corresponding  to  the  first  row  and  fifth  column.  The  condition  is 


•}•  See  also  the  footnote  to  theorem  II. 
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Let  us  mention  that  the  two  analogous  problems  for  the  trigonometric 
series  of  periodic  functions  have  been  studied  partially  by  W.  H.  Young,  f 
H.  Steinhaus,J  S.  Sidon,§  A.  Zygmund||  and  S.  Bochner,^]  and  more  fully 
by  M.  Fekete  f  j*  and  S.  Verblunsky.  J  J  The  influence  of  the  papers  of  the  last 
two  authors  on  the  methods  of  the  present  work  will  be  apparent  to  the 
reader. 

The  problem  of  multiplicators  for  Fourier  transforms  was  proposed  to 
me  by  Professor  Fekete,  of  the  Hebrew  University  of  Jerusalem.  His 
suggestions  and  criticisms  were  invaluable  in  shaping  my  ideas.  I  owe  to 
him,  in  particular,  the  statement  of  theorem  II,  which  has  proved  exceed- 
ingly helpful.  For  his  kind  assistance  I  should  like  to  express  my  most 
profound  gratitude. 

1.    Some  known  results 
We  shall  make  use  of  the  following  known  results: 

1.  For  every  a  >  0  the  function  J[l  —  |  u  \/a]9  where 

if 

a  -  0  if     |  u\  >a, 

is  the  Fourier  transform  of  the  function  (1  —  cos  ax) /ax2,  absolutely  integrable 
and  uniformly  continuous  in  the  interval  ( —  oo,  4-  oo). 

2.  Ifg(u)  €  (FI),  g(u)  is  continuous  and  bounded  in  the  interval  ( —  oo,  +  oo).§§ 


f  W.  H.  Young,  "On  a  condition  that  a  trigonometrical  series  should  have  a  certain 
form",  Proc.  Royal  Soc.  88  (1913),  569-574.  Also  "On  Fourier  series  and  functions  of 
bounded  variation",  ibid.  88  (1913),  561-568. 

|  H.  Steinhaus,  "Additive  und stetige Funktionaloperationen ",  Math.  Zeitschrift, 5 
(1919),  186-221. 

§  S.  Sidon,  "  Reihentheoretische  Satze  und  ihre  Anwendungen  in  der  Theorie  der 
Fouriersche  Reihen",  Math.  Zeitschrift,  10  (1921),  121-127. 

11  A.  Zygmund,  "Remarque  sur  un  theorerne  de  M.  Fekete",  Bull.  Acad.  Polonaise 
Sci.  Lett.  (1927),  343-347 

T[  S.  Bochner,  "t^ber  Faktorenfolgen  fur  Fouriersche  Reihen",  Acta  Univ.  Szeged, 
4  (1929),  125-129. 

tt  M.  Fekete,  "t)ber  Faktorenfolgen  welche  die  Klasse  einer  Fouriersche  Reihe 
unverandert  lassen"",  Acta  Univ.  Szeged,  1  (1923),  148-166.  See  also  two  notes  in 
Comptes  Rendus  190  (1930),  1486  and  193  (1931),  16. 

•$.$  S.  Verblunsky,  "On  some  classes  of  Fourier  series",  Proc.  London  Math.  Soc. 
(2),  33  (1932),  287-327. 

§§  See,  for  example,  S.  Bochner,  Vorlesungen  iiber  Fouriersche  Integrate  (Leipzig, 
1932),  46;  quoted  in  the  sequel  as  Bochner. 
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3.    Theorem  of  G.  H.  Hardy.  ,f   Let  g(u)  be  the  Fourier  transform  of  the 
function  f(x)z  {!}.  Then  the  sequence  of  functions  (n  =  1,2,...) 


converges  to  \{f(x  +  0)  +/(o?  —  0)}  whenever  this  last  expression  has  a  meaning, 
and  converges  tof(x)  almost  everywhere. 

4.    Letf(x)e{l}andlet 

g     (,=  1,2,...); 


lim  f  +°°|  /(a?)  -/„(#)  |  cfo  =  0. 

v—>ooj  —co 


5.    Le£  /(#)e{l}  am?  £e£  ite  Fourier  transform  g(u)  be  absolutely  integrable 
in  the  interval  (  —  oo,  +00).  Then  we  have  almost  everywhere 


==  f+C°i7(M)e«ttaJdit, 

J   -00 


and  the  equality  holds  everywhere  iff(x)  is  continuous.  § 

6.    Let  g^(u)  and  g^(u)  respectively  be  the  Fourier  transforms  o//1(a:)e{l} 
andf2(x)e{l}.  Then\\ 


f+ 

J    -0 


7.  Le£  g^u)  and  g2(u)  respectively  be  the  Fourier  transforms  of  f^x)^^} 
and  fz(x)e{I}.  Then  g±(u)  gz(u)  is  the  Fourier  transform  of  the  absolutely 
integrable  function 


8.    Theorem  of  Berry.  |t  A  necessary  and  sufficient  condition  that  \(u)  e  ( 
is  that 

(1)  A(ifc)  be  bounded  and  measurable  in  (  —  00,  +00), 

(2)  there  exists  a  constant  C  such  that 


U*"A(«)/(*)*» 
for  every  f(x)  e{l}  whose  Fourier  tran^ 

^  O     sup      |  g(u)  \ 

—  00  <M<  00 

sform  is  g(u), 

•j*  See,  for  example,  Titchmarsh,  Introduction  to  the  Theory  of  Fourier  Integrals 
(Oxford,  1937),  29. 

J  Titchmarsh,  loc.  cit.  p.  35.  §  Bochner,  loc.  cit.  p.  51. 

||  Titchmarsh,  loc.  cit.  p.  54.  ^f  Bochner,  loc.  cit.  p.  46. 

tf  A.  C.  Berry,  "Necessary  and  sufficient  conditions  in  the  theory  of  Fourier  trans- 
forms", Annals  of  Math.  32  (1932),  830-838. 
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(3)  to  every  e  >  0  we  can  make  correspond  a  $  >  0  such  that 


\(u)f(u)du 


<esup  \g(u)  | 


for  every  function  /(#)e{l}  whose  Fourier  transform  g(u)  is  summable  in 
( —  oo,  +  oo)  and  verifies  the  condition 


9.    The  function  i/(u~i)  is  the  Fourier  transform  of  the  function  k(x), 
absolutely  integrable  and  of  bounded  variation  in  (  —  oo,  +00),  where 


f 

=  \ 


-  2ne-x    if 

o       if 


10.    Theorem  of  Bochner.  ^  The  relation  g(u)  e  (F$)  implies  the  two  relations 
g(u)  €  (jFi)  and  iug(u)  e  (I\).  Conversely,  the  last  two  relations  imply  the  first  one. 

2.    On  the  conditions  that  a  function  g(u)  should  be  a  Fourier  transform 


THEOREM  I.   A  necessary  and  sufficient  condition  that  A(tt)€(.#i)  is  that 
(^)[l  —  |  u  \/n]  be  continuous  for  every  positive  integer  n  and  that  the  functions 


J  - 


satisfy  the  two  conditions: 

(1)  There  exists  a  constant  A  such  that 

I  &n  (x)  \dx^A    for  every  n. 

(2)  To  every  e  >  0  we  can  make  correspond  a  8  >  0  such  that  for  every  set  E 
whose  measure  is  less  than  d 

|  orn(x)  |  dx  ^  e    for  every  n. 

Proof.  Necessity.  Let  l(x)t{\}  be  the  generatrix  of  X(u).  Since  A(w)  is 
continuous  (result  2),  the  function  A(w)[l  —  |  u  \/ri]  is  also  continuous. 


"f"  Bochner,  loc.  cit.  p.  89. 

J  Bochner,  loc.  cit.  pp.  92-93.  We  verify  at  once  that  the  statement  of  the  text  is 
equivalent  to  the  statement  given  by  Bochner  in  his  book  if  we  observe  that  a  necessary 
and  sufficient  condition  that  a  function  <f>(x)  be  an  indefinite  integral  of  some  function 
of  the  class  {1}  is  that  <p(x)  be  absolutely  continuous  and  be  of  bounded  variation  over 

(-00,  +00). 
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We  have,  on  the  other  hand  (result  3), 
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O"n(#)    =   ~ 


Therefore 

/•  .LOO 

I  I  / 

I  O"~[, 

J-J      ^ 


dx 


/•  + 

| 

J  —  oo 


Finally,  since  the  indefinite  integral  of  an  absolutely  integrable  function  is 
an  absolutely  continuous  function  of  a  set,  to  every  e  >  0  we  can  make 
correspond  a  d  >  0  such  that  for  every  set  E  whose  measure  is  less  than  S 


_TT,  f  ,      ,  v  i  _,        1  f  +°°  1  —  cosn£  Jr  f 

Whence  kn(a?)   flte<~          —  ^^d^\ 

J  E  ffj  -co        nc,  j 


Sufficiency.  We  shall  prove  that  if  the  conditions  of  theorem  I  are  satisfied, 
then  the  conditions  of  Berry's  theorem  are  satisfied  too. 

(1)  Since  A(^)[l  —  |  u  \/n]  is  summable  in  (  —  00,4-00),  and  since  its 
Fourier  transform  (l/27r)cr/l(  —  #)e{l},  we  have,  by  result  5,  everywhere, 
since  A.(u)  [1  —  |  u  \jn]  is  continuous, 


This  inequality  being  true  for  every  positive  integer  n  gives 


. 

(2)  Let/(#)  be  an  arbitrary  function  of  the  class  {1}  and  let  g(u)  be  its 
Fourier  transform.  As  we  have  just  seen,  the  function  A(^)[l  —  |  u  \/n]  is 
the  Fourier  transform  of  the  absolutely  integrable  function  crn(x).  We  have, 
therefore,  by  result  6, 


L      sup 

—  OCX  M  <  CO 


\g(u)\. 
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When  n->ao,  the  absolutely  integrable  functions  A(t*)[l  —  |  u  \/n]f(u)  tend 
to  A.(u)f(u),  being  less  in  modulus  than  the  absolutely  integrable  function 

(1/277)4  | /M  | .  Therefore 


I  /»+oo  i  /•+<»      p     i  •?/  n 

\(u)  f(u)  du  =  lim          X(u)    1  -  J—1 

U-oo  w.->.oo|J-oo  L  ^    J 


(3)  Let  f(x)  €  {1}  be  any  function  whose  Fourier  transform  g(u)  verifies 

!*+co 
\g( 

J   -00 

We  know  already  that 


r  +  m  /•  +  » 

I       X(u)f(u)du   =  lim    I       g(u)crn(u)du 

J  —  oo  n->oo    J  —  oo 


Write 
We  have 


r 

J  -a 


g(u)<rn(u)du 


I         g(u)(rn(u)du  4-  g(u)o~n(u)du 

\J  Ektn  \J  CEk,n 


J/*  +  oo 
|crw(M)|dM  +  *  I        |?(tt)|cft* 
Efc,n  J  -  QO 

<  SUp  |  gr(lt)  |  (  f        |  orn(«)  |  du+  kfi\  . 
\JEk,n  ) 

Choose  a  fixed  k  such  that  I       |  crn(u)  \  du  ^  %e  for  every  n.  This  is  possible 

J  Ek,n 

by  the  conditions  of  our  theorem  since  |  EktU  \  ^A/k.  Then  choose  d  >  0  such 
that  kd  <  Je.  This  gives 


The  three  conditions  of  Berry's  theorem  are  thus  satisfied  and  the  theorem 
is  proved. 

THEOREM  II.  Necessary  and.  sufficient  conditions  that  X(u)  e  (F2)  are  that 
A(w)[l  —  |  u  \/ri]  is  continuous  for  every  positive  integer  n  and  that  the  functions 


r+c°      r     \u 

crn(x)  =          A(ti)     1-1— 

J-QO  L  n 


x  du 


satisfy  the  two  conditions: 

(I)  There  exists  a  constant  A  such  that 


/*  +  00 

I  crn(x)  |  dx  ^  A    for  every  n. 

J    -00 
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(2)  There  exists  a  constant  B  such  that 

|  <rn(x)  \^B    for  every  n. 

[The  statement  of  this  theorem  was  kindly  communicated  to  me  by 
Professor  Fekete.  I  first  proved  it  with  the  help  of  the  following  extension 
of  a  theorem  of  S.  Banach:  "Let  the  functions  ocn(x)  be  summable  and 
essentially  bounded  in  (  —  oo,  -f  oo)  and  let  them  satisfy  the  two  conditions  : 


/: 


an(x)   dx^A,     ess.  u.b.  |  an(x)  \  <  B, 


where  A  and  B  are  constants  independent  of  n.  Then  we  can  extract  a 
partial  sequence  {anfc(x)}>  and  we  can  find|  a  function  a(x)  e  {2}  such  that 

for  every  /(#)e{l} 

/•+«>  /•  +  «> 

Km         f(x)ank(x)dx  =         f(x)ct(x)dx" 

fc-»00  J    —00  J   —00 

A  direct  proof  of  theorem  II,  independent  both  of  Berry's  result  and  the 
above-mentioned  theorem  on  linear  operations,  has  been  recently  (after 
the  completion  of  this  paper)  communicated  to  me  by  Professor  Fekete, 
who  found  also  a  characterization  of  the  functions  of  the  class  (F^  in  terms 
of  the  (rn(x).  Later  on,  I  found  that  a  statement  given  by  H.  Cramer  J  and 
modified  by  Gonzalez  Dominguez§  seems  to  coincide  with  the  statement 
of  theorem  I]. 

Proof.  Necessity.   We  have  only  to  prove  the  necessity  of  the  last  con- 
dition. Suppose  that  l(x)€{2}  is  the  generatrix  of  \(u).  We  have 


u.b.        Z(*_|)  =      u.b. 


<  co 


Sufficiency.  It  is  evident  that  condition  (2)  of  theorem  II  implies  con- 
dition (2)  of  theorem  I.  We  have  only  to  choose  S  —  e/B.  Therefore  A(w)  is 
already  the  Fourier  transform  of  some  function  l(x)  e{l).  By  Hardy's 
theorem,  we  have,  almost  everywhere, 

|  l(x)  |  =  lim  |  <rn(x)  |  ^  B. 

m->oo 

The  function  l(x)  is  therefore  essentially  bounded.  Whence  follows  the 
existence  of  a  function  l*(x)  e  {2}  equivalent  to  l(x)  and  so  having  the  same 
Fourier  transform  X(u).  This  completes  the  proof. 


t  Compare  S.  Banach,  TMorie  des  Operations  IMaires  (Varsovie,  1932),  130-131. 
j   Trans.  American  Math.  Soc.  46  (1939),  191-201. 

§  Duke  Math.  J.  6  (1940),  246-255.  I  have  been  unable  to  consult  this  paper. 
Cf.  Math.  Reviews  1,  (1940),  226. 
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THEOREM  III.  A  necessary  and  sufficient  condition  that  A(w)  e  (F^)  is  that 
the  conditions  of  theorem  If  be  realized  and  that  the  functions  o~n(x)  be  uni- 
formly M-integrable  in  (  —  oo,  +  oo).f 

Proof.  Necessity.  The  conditions  of  theorem  II  are  of  course  necessary. 
To  prove  the  necessity  of  the  additional  condition  suppose  that  A(^)  be 
the  Fourier  transform  of  o~(x)  €  {3}.  Then 


Let  e  >  0  be  an  arbitrary  number  and  (  —  a,  +  a)  an  arbitrary  finite  interval. 


Let  M  be  the  upper  bound  of  |  a(x]  |  in  ( —  oo,  +  oo).  Since 
we  can  define  a  positive  number  X  such  that  for  every  n  ^ 


-  cos  nE, 
__ 


We  have  also 


i  r+^i^ 

7TJ-X 


(1) 


(2) 


On  the  other  hand,  since  &(x)  is  7?-integrable  in  the  interval  (  —  a  —  X, 
+  a  +  JC),  we  can  find  an 


such  that,  («!,/?!),  ...,  (#„,/#„)  being  any  finite  sequence  of  non-overlapping 
intervals  contained  in  (  —  a,  4-  a)  and  verifying  the  relations  0  <  (fti  —  a^)  <  if 
(i  =  1,2,  ...,  v),  and  yt-,  5t.  being  any  numbers  satisfying  the  conditions 
ai  ^  7i  ^  ^i  ^  &»  we  have  for  every  g  in  (  —  X,  +  X) 


-  g)  -  cr(7i  -  g)}  (>?,  -  a,) 


(3) 


t  A  function  cr(x)  e  {2}  is  said  to  bo  J?-integrable  in  (  —  oo,  -f  oo)  if  it  is  .R-integrable 
over  every  finite  interval  /  =  (  —  a,  +  a)  (a  >  0),  i.e.  if  to  every  e  >  0  and  to  every  interval 
/  we  can  make  correspond  an  ^  =  9/{e,  /}  such  that  (a,lt  /?t), . . .,  («„,/?„)  being  any  finite 
sequence  of  non-overlapping  intervals  contained  in  /  and  satisfying  the  condition 
0<(/?f  —  af)<?7  (i  =  1,2,  ...,»>)  and  7,,  <5f  satisfying  the  condition  a^ < 7^ < dt < /?<f 
w^e  have 


A  sequence  of  functions  (Tn(x)  €.  {2}  is  said  to  be  uniformly  .R-integrable  in  (  —  oo,  4-  oo) 
if  the  numbers  7j{e,  1}  can  be  determined  independently  of  n,  so  that  the  preceding 
inequality  holds  for  every  <rn(x). 

In  a  previous  redaction  of  this  work  I  had  adopted  another  definition  of  J?-inte- 
grability  in  (  —  oo,  +  oo).  Professor  J.  M.  Whittaker,  of  Liverpool,  kindly  pointed  out 
to  me  that  it  was  not  the  usual  one. 
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For  £  outside  this  interval 


But  we  can  write 


. 
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~  £)}  (A  -  «< 


1  -  COS  ft£ 


This  last  relation  shows  by  (1)  and  (4)  and  by  (2)  and  (3)  that 


dg. 


t-i 

This  proves  that  the  functions  crn(x)z{2}  are  uniformly  .R-integrable  in 
(-00,  +00). 

Sufficiency.  If  the  conditions  of  theorem  II  are  satisfied,  A(^)  is  already 
the  Fourier  transform  of  some  function  <r(x)  e{2}.  Let  e  be  any  positive 
number  and  /  =  (-a,  4- a)  (a>0)  any  finite  interval.  With  the  notations 
and  the  assumptions  of  the  preceding  footnote,  we  have,  by  hypothesis, 
for  every  n 


By  Hardy's  theorem,  the  sequence  crn(x)  tends  to  o"(x)  everywhere  except 
perhaps  on  a  set  E  of  measure  zero.  Suppose  the  y^,  St  be  chosen  on  the  set 
ICE.  Going  to  the  limit 


S  { 


«,) 


The  function  <r(x)  is  therefore  essentially  .R-integrable  in  ( —  a,  +  a).  Replace 
it  by  the  equivalent  function  <r*(x)  defined  as  follows  :f 

on  GE:  cr*(x)  =  or(*), 

on  E:  <r*(x)  =  lim  Ra-(£)  +  i  Jim  /cr(£),     where     £eCE. 

t^x  l-+x 

The  function  <r*(jc)  is  /2-integrable  in  (—a,  -ha)  and  its  Fourier  transform 
is  A(i*).  But  since  cr*(x)€.{2},  and  since  the  interval  (—a,  +a)  is  arbitrary, 
we  have  <r*(x)  €  {3}.  Whence  A(w)  €  (F3). 


See  the  first -mentioned  paper  by  Fekete,  Hilfssatz  1. 
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THEOREM  IV.  A  necessary  and  sufficient  condition  that  A(-w)  €  (1^4)  is  that 
the  conditions  of  theorem  II  be  realized  and  that  the  functions  <rn(x)  be  uniformly 
(in  n  and  x)  continuous. 

Proof.  Necessity.  The  conditions  of  theorem  II  are,  of  course,  necessary. 
To  prove  the  necessity  of  the  additional  condition  suppose  that  X(u)  be  the 
Fourier  transform  of  cr(x)  e  {4}.  Then 


By  the  properties  of  cr(#),  to  every  e  >  0  we  can  make  correspond  a  d  >  0  such 
that  the  relation  |  x±  —  x2  \  <  #,  ( —  oo  <  xlt  x2  <  oo)  implies  [  o-(x±)  —  cr(#2)  |  <  e. 
Whence 

c\  c\  I  ^  ~  cosng  ,., 


— 


The  functions  crn(x)  are  thus  uniformly  (in  n  and  x)  continuous. 

Sufficiency.  If  the  conditions  of  theorem  II  are  satisfied,  A(^)  is  already 
the  Fourier  transform  of  some  function  o"(x)  e  {2}.  By  Hardy's  theorem  the 
sequence  o-n(x)  converges  to  cr(x)  everywhere  except  perhaps  on  a  set  E  of 
measure  zero.  By  our  assumptions,  to  every  e  >  0  we  can  make  correspond 
a  £>0  such  that  \x1  —  x2\^3  implies  \o"n(xi)—  o~n(x2)  \  ^e  for  every  n. 
Suppose  X-L,  x2  be  chosen  on  the  set  CE.  Going  to  the  limit  |  o'(x1)  —  cr(x2)  \  ^  e. 
The  function  o"(x)  is  thus  essentially  uniformly  continuous  in  (—00,  4-oo). 
We  conclude  that  there  exists  a  function  o~*(x)  equivalent  to  o~(x)  and  so 
having  the  same  Fourier  transform  A(^),  which  is  uniformly  continuous 
in  (  —  oo,  +  oo).  Whence  A(w)  e  (F4). 

THEOREM  V.  A  necessary  and  sufficient  condition  that  A(u)  €  (F5)  is  that  the 
functions  A(w)[l  —  |  u  \/n]  be  continuous  for  every  positive  integer  n  and  that 
the  functions 


satisfy  the  folloiving  condition  : 

There  exists  a  constant  C  such  that  for  every  n 


J    - 


Proof.  We  shall  use  the  following  lemma:  its  proof  is  exactly  the  same  as 
that  of  theorem  IV,  and  so  I  omit  it. 

M2 
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LEMMA.  A  necessary  and  sufficient  condition  that  A(u)e(F5)  is  tlmt 
X(u)[l—  |  u  \/n]  be  continuous  for  every  positive  integer  n  and  that  the 
functions 

<r»<*)  =  I  +  MA(M)  |~1  -  WleC-dH  (6) 

J  -CO  \_  n    J 

satisfy  the  three  conditions: 


J  — 


(7) 


|  <rn(x)  I  <  JB,     (-oo<z<oo),  (8) 

(9) 


where  A,  B,  VQ  are  constants  independent  of  n  and   V{crn(x),  (  —  oo,  +oc)} 
represents  the  variation  ofcrn(x)  in  the  interval  (  —  00,  +00). 

Observe  first  that,  by  a  theorem  of  Lebesgue,  condition  (9)  may  be  written 

(10) 

where  0-^(#)  is  the  derivative  of  o~n(x). 

We  make  use  of  the  function  [e~&]  defined  as  e~^  if  g  ^  0  and  equal  to  zero 
if  g  <  0.  We  have 

»          r  +  ° 

-^  =  - 

U  —  l  J-o 

Hence  ^ 


o-n(a;)  =  -(^(\(u)  ^  fl  -  1~1J  J+<Vg] 

r+oo/          /»-foo         «/__*T       I  -»/  11 
([e-S]         A(%)^    1-l^J- 

J-ooV  J-co  *L  7iJ 


Sufficiency.  Suppose  that  condition  (5)  is  satisfied.  Then 


sup 

—  oo<  ^  <o 


Similarly 

r+o°i  o^*)  i  <&<  r  +co  r 

J  —oo  J  —  ooj  —c 
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Finally,  differentiating  relation  (6)  with  respect  to  x  (the  limits  of  integration 
are  finite  and  the  derivative  of  the  function  under  the  integral  sign  is 
continuous)  we  get 

<(*)  =  f  +  °°A(^)  iu  [~1  -  W]  e™*  du, 

J  -oo  L  n    J 

whence  rn(x)  =  -orn(x)-cf/n(x). 

This  gives  by  (5)  and  (!') 

°°  (10') 


Conditions  (7),  (8)  and  (10)  of  the  lemma  are  thus  satisfied  and  A(^)  €  (F5). 

Necessity.   Conversely,  suppose  that  conditions  (7),  (8)  and  (10)  of  the 
lemma  are  satisfied.  This  gives 


r'\Tn(x)\dx<  r~\<rn 

J  —  CO  J   —00 

The  proof  is  now  complete. 


THEOREM  VI.  The  two  relations  g(u)  e  (F6)  and  —  ^~~g(u)  e  (FJ  are  equivalent. 

Proof.  Let  g(u)  be  an  arbitrary  function  of  the  class  (  F6).  By  the  theorem 
of  Bochner  we  have  —  g(u)  €  (  I\)  and  —i^(u)€(Fl).  Whence,  by  addition, 


Conversely,  let  '^^y(u)e(Fl).  (11) 

Since  -  :e  (^i),  we  have  (result  7) 
u  —  i 

^lg(u)~.^g(n)^(F1).  (12) 

tf  U  ~~  v 

By  (11)  and  (12)         ^lg(u)  +  g(u)  =  ^iug(u)e(Fl).  (13) 

i 

By  Bochner's  theorem  (12)  and  (13)  imply 


Remark.   We  shall  use  sometimes  the  following  form  of  theorem  VI: 
The  two  relations  .  .  . 

g(u)e(F1)     and     ^e(F6) 

Mr  ~~~  if 

are  equivalent. 
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3.    The  fundamental  theorem 

Before  stating  our  fundamental  theorem,  let  us  recall  a 
Theorem  of  Steinhaus.^  Let  the  functions  o~n(x)  be  measurable  and  essentially 
bounded  in  the  interval  ( —  oo,  +00)  and  let 

lim  sup  (essu.b.  |  crn(x)  \ )  =  oo. 

71— >OO 

Then  there  exists  a  function  f(x)  €{1}  such  that 

I  f  +  0° 
lim  sup          f(x)  crn(x)  dx   =  oo. 

n->oo       |J  —  oo 

THEOREM  VII.    //  the  functions  o"n(x)  are  absolutely  integrable  in  the 
interval  (  —  00,  +00),  and  if 

r  *j~  * 
lim  sup          |  o~n(x)  |  dx  —  oo, 

n->oo      J  —  oo 

then  there  exists  a  function  /(#)«€  {4}  such  that 


lim  sup 


f-J-ool   f  +  00 

f(x- 

J  —  oo  J  J  —  oo 


£,)ffn(i)di 


dx  =  oo. 


The  proof  is  given  in  two  parts. 

I.   If  o"(x)  €{1}  we  shall  show  that,  given  any  positive  rj  <  J,  there  exists 
a  function  f(x)  €  {4}  satisfying  the  three  conditions 


T"  f+"/(*- 

J    —00       J   —00 


(*)!<!,  ra>\f(x)\dx 

J    -00 


Making  use  of  result  4  with  respect  to  the  function  cr(ic),  we  see  that  there 
exists  an  integer  v^2  such  that 


/: 


1  __ 


Hence 


p  ipi- 

J-oo    TTJ-oo 


<fo 


J+oo  i  r+°°i_cosv£  r+ 

-  °0a8VV(a;-g)dgfe>(i-^) 

-oo     TlJ—oo  V^  J— 


f  H.  Steinhaus,  ?oc.  cti.  pp.  219-221.  Steinhaus  has  given  his  theorem  for  finite 
limits,  but  there  is  no  difficulty  in  proving  it  for  infinite  limits. 
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Since  the  integral 


183 


r  +  c 
J  —  co 


L  —  cos  v(x  —  £) 


da; 


exists,  in  virtue  of  the  foregoing  inequality  we  can  choose  (i)^2/^n  so 
large  that 


i-??rq 

^        J-OC 


Hence  if  J57  is  any  interval  or  sum  of  intervals  which  have  no  point  in 
common  with  (—  o>,  - 


T  -,  i.,  v 

Let  A  =  2W  and  take     /(*)  =  -^ 

By  this  choice  of  A  and  0)  for  Q^Ja^v—l  we  have  the  inequality 


1    fw+fcA    I  /•  +  » 

i.L+JJ-»/(*H 


dx 


do; 


I  v-l   /*w+feA     I   r  +  co   I 
^•-oJ-w-HfcAjJ  -oo 


—  cos  v(x  —JA  —  £) 


dx 


Hence 


/»^.oo|  /»-f-oo  v-l  /»w+A;A        /*  +  oo 

/(*-g)«r(Qdg  dz>  S  /(*- 

J_co|J-oo  fc=-0j  -6>  +  fcA    J  -oo 


Now  by  the  above  choice  of  /(#),  y,  A,  w,  we  clearly  have 
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if  x  lies  outside  all  the  intervals  of  diameter  a>  whose  centres  coincide  with 
-~I),  while 

1       124 


as  x  ranges  over  one  of  these  intervals.  Thus  0 </(#)<  1  for  —  oo<x<oo. 
Finally 

r~\f(x)\dx=i. 

J  -co 

This  completes  the  proof  of  part  I. 

/•-}-.  CO 

II.    fLet       I       |  crn(£)  \  d£  =  o)n,     where     lim  supe^  =  oo. 

J  —  oo  n->oo 

By  part  I,  to  every  0*n(g)  we  can  make  correspond  a  function  fn(x)  €  {4} 
such  that 

I  /»(*)  I  <  1,     f +°°|  /„(*)  |  «fa  -  1,     f +W  i  f  *"/»(*-l)«r.l 

J—  00  J    —  00     |J    —  00 

If  any  of  the  functions  fk(x)  satisfies  the  condition 


lim  sup 


f"*" 

J  —as 


dx  =  oo, 


the  theorem  is  proved.  We  may  suppose  therefore  that  for  every  k 


fT 

J   _oo       J    -, 


for  every  w, 


where  the  cfc  are  finite  constants. 

We  shall  show  that  we  can  extract  a  subsequence  /fcp(#)  such  that  the 
function 


verifies  the  conditions  of  our  theorem. 


\  Many  simplifications  in  this  part  of  the  proof  are  due  to  Professor  Fekete  and 
Dr  M.  Schiffer  of  Jerusalem.  Another  short  proof  of  part  II  can  be  given  by  making 
use  of  the  following  theorem  of  Banach  and  Steinhaus  : 

"Let  the  Un(x)  be  a  sequence  of  linear  operations  defined  in  a  Banach  space  E,  and  let 

lim  sup  ||  Un(x)  ||  <  oo 


for  every  x  e  E.  Then  the  sequence  of  norms  ||  Un  \\  is  bounded." 
See  the  above-mentioned  book  by  Banach  (p.  80,  theorem  6). 
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It  is  evident  that  whatever  be  the  subsequence  fkp(x),  f(x)€.{4}.  Now 
for  every  p 


/••foe  I  r  +  oo 

/(*- 

J    —00    |J   —00 


dx 


1    /»  +  » I  r  +  oo 

-Ss  /fcj(* 

i<p  °   J  —oo  |J  —oo 
1     /*  +  <»     p  +  ao 

-Sa  /*,<* 

i>2?  ">  J  —oo     J  —oo 


dx 
dx 


The  subsequence  ki  having  been  chosen  up  to  kp_t  we  can  always  choose, 
owing  to  lim  sup  a)n  —  oo,  a  number  kp  such  that 


This  proves  the  theorem. 


4. 


multiplicators  for  Fourier  transforms 


THEOREM  VIII.  A  necessary  and  sufficient  condition  for  X(u)  to  be  respec- 

tively a  multiplicator  (5,1),  (5,2),  (5,3),  (5,4),  (5,5)  is  that  —  ^-4  belongs 

11  —  "  i/ 

respectively  to  the  classes  (J^i),  (Fz)9  (Fs),  (FJ,  (F6). 


Proof.   Necessity.   Since  --  -. 
— 


(result   9)   the   stated   condition   is 


necessary,  by  definition,  in  every  case. 

Sufficiency.  The  proof  is  very  similar  in  the  five  cases.  We  shall  give  it, 
by  way  of  example,  successively  for  multiplicators  (5,  1),  (5,  5)  and  (5,  3). 


For  multiplicators  (5, 1 ).  Let  — -7  be  the  Fourier  transform  of  an  arbitrary 


function  of  the  class  {!},  and  let 


(14) 


We  have,  by  theorem  I, 

(1)  r<X>\sn(x) 

J   —00 
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(2)  to  every  e  >  0  corresponds  a  S>  0  such  that  for  any  set  E  of  measure 
less  than  8  . 

J  E  ^ 

Let  now  g(u)€(F5)  be  the  Fourier  transform  of  the  arbitrary  function 
/<*)€{*}• 

The  functions  ^—^~~  and  g(u)  are  continuous  (result  2).    The  same  holds 

u  -~  i> 

for  X(u)  and  therefore 

g(u)  X(u)  I  1  —  - — -  'I  is  continuous  for  every  positive  integer  n.     (15) 

Consider  the  two  functions 

r+°°         r       \u\"l 
^"^""J-oD          L  ~    n  ]e 

We  have 

S(X)  =  I  rm{x(u)  [l  -  W~U,*  f+<*f(£)e-**d£\du 
arrj.oo  {         L        n  J        J-oo 


If  we  differentiate  relation  (14)  with  respect  to  x  (in  fact,  the  limits  of 
integration  are  finite  and  the  derivative  of  the  function  under  the  integral 
sign  is  continuous),  we  get 

«;<*)  =  -  r^wli- 
J-«      L 

Whence  <rn(a;)  =  -«„(«)  -*!,(*), 


But  we  can  write 

f  +"/(*-£)< 

J  —oo 

&  -  -  f  *"«»(*- 

J  —  OO 
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for  sn(g)  is  bounded  (result  2)  and  /(#-£)  tends  to  zero  when  £->±oo, 
since  /(£)  is  absolutely  integrable  and  of  bounded  variation  in  (  —  oo,  4-  oo).*f 


Hence       Sn(x)  =  -/(£)«,.(*-0<£  +  *»(*-£)#«>-  <20) 

We  conclude  by  the  property  (1)  of  sn(x) 

+  ~AJ+_Jdm\  ~AK,      (21) 
where  K  is  a  finite  constant  equal  to 


f 

Let  now  e'  be  any  positive  number.  Take  e  =  e'/K,  and  let  8>  0  be  the 
number  corresponding  to  e.  For  every  set  E  whose  measure  is  less  than  S 
we  have 


f   \ 

JE 


Relations  (15),  (17),  (21)  and  (22)  show  now,  by  theorem  I,  that 


This  proves  that  A(w)  is  a  multiplicator  (5,  1). 

For  multiplicatora  (5,5).  Let  —  ^—l€.(F&)  be  the  Fourier  transform  of  an 

u  —  i 

arbitrary  function  of  the  class  {5}.  We  use  the  same  notations  as  before. 
It  is  clear  that  relation  (20)  holds.  Moreover,  by  the  lemma  of  theorem  V 
there  exists  a  constant  B  such  that 

I  sn(x)  I  ^  &    f°r  every  n. 


•{•  To  see,  for  example,  that    lim    /(£)  =  0,  we  proceed  as  follows.  /(£)  belonging 

«-*  +  «  fCD 

to  the  class  {5}  we  can  find  a  number  Xt  such  that  I       I  df(£)  I  <  e,  where  e  is  an 

Jxt 

arbitrary  positive  number.    But  since  /(£)  belongs  to  the  class  {1}  we  can  find  an 
Xt>Xl  auch  that  |/(^2)  \<€<  Whence, 


/(«)=  f* 
J  xt 

and  so  |  f(x)  \  <  2e. 
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We  want  to  show  that  there  exists  a  constant  A  such  that 

0 

I  o~n(x)  |  dx  ^  A     for  every  n.  (31) 

> 

In  fact  if  (31)  does  not  hold,  there  exists,  by  theorem  VII,  f  a  function 
f(x)  e  {4}  such  that 


/»  +  co     |*  + 

Mm  sup  j         J 

n->oo       J  -co    J  — 


dx  =  oo, 


contrary  to  (30).  Therefore  (31)  holds.  By  theorem  V,  (29),  (16')  and  (31) 

,    i\(u\    ,  „  v 
imply  -±-j€(F5). 

c*>  —  ti 

THEOBEM  X.  A  necessary  and  sufficient  condition  that  X(u)  be  a  multi- 
plicator  (1,2)  is  that  /\.(u)e(F2).  The  condition  is  the  same  for  multiplicators 
(\,Z)and  (1,4). 

Proof.  Sufficiency.  We  need  only  show  that  whatever  be  g(u)  e  (F^  and 
\(u)  e  (F2)  then  g(u)  A.(u)  is  the  Fourier  transform  of  some  function  abso- 
lutely integrable  and  uniformly  continuous  in  (—00,  +00). 

Let/(#)c{l}  and  l(x)€.{2}  be  respectively  the  generatrices  of  g(u)  and 
X(u).  By  result  7,  g(u)\(u)  is  the  Fourier  transform  of  the  absolutely 

integrable  function  i    /•  +  « 

—         /(*-g)I(g)dg.  (32) 

2?rJ  _„ 

But          f'l""{/(*  +  »-S)-/(* 

J   -CO 

<     u.b.    |t( 

—  00<^<OO 

-     u.b. 


<oo 


£)  |  f+" 

J  —  cx> 


This  last  integral  tends  to  zero  with  y  by  a  theorem  of  Lebesgue. 

The  function  (32)  is  thus  uniformly  continuous. 

Necessity.  We  need  only  prove  that  if  A(^)  is  a  multiplicator  (1,2)  we 
have  necessarily  X(u)  e  (^2)-  To  do  this,  we  shall  show  that  all  the  require- 
ments of  theorem  II  concerning  the  functions  A(^)[l  —  |  u\/n]  and  crn(x) 
defined  by  (16)  are  satisfied. 


f  Every  &n(x)  certainly  belongs  to  the  class  {!}.  To  see  this,  observe  that  the 
function  A(w)[l  —  j  u  |/w]  is  absolutely  integrable  in  (  —  00,  H-oo).  Moreover,  it  is  the 
Fourier  transform  of  some  function  (r*(x)  €  {!},  since  A(w)  is  a  multiplicator  (4, 1)  and 
since  [1  —  |  u  |/n]£(J?T4)  (result  1).  Hence,  by  result  5, 

<rn(x)  =*  cr*(x) 
almost  everywhere,  which  proves  that  crn(x)  belongs  to  the  class  {!}. 
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Relations  (15),  (17),  (21),  (23)  and  (28)  show,  by  theorem  III,  that 

X(u)g(u)e(F3). 
This  proves  that  A(w)  is  a  multiplicator  (5,  3). 

THEOREM  IX.  A  necessary  and  sufficient  condition  that  X(u)  be  a  multi- 

plicator (1/1)  is  that  —  ^—  r€(^5).  The  condition  is  the  same  for  multiplicators 

u  ~~~  I/ 

(2,1),  (2,  2),  (3,  1),  (3,  2),  (3,  3),  (4,  1),  (4,  2),  (4,  3),  (4,  4). 

Proof.  Sufficiency.  We  need  only  show  that  if  -  —.  I  €  (F5),  then  \(u)  is  a 

u  —  i 

multiplicator  (1,  1),  (2,  2),  (3,  3)  and  (4,  4).  The  proof  is  the  same  in  the  four 
cases.  We  shall  give  it,  by  way  of  example,  for  the  case  (3,  3). 

Let  gB(u)  be  an  arbitrary  function  of  the  class  (F3).  By  theorem  VIII, 

—  ^-9a(u)  is  a  multiplicator  (5,3).   But  since  by  hypothesis  ---  r€(J?J), 

"I  U  —  1 

we  have  */_«          ?'Ar?/^ 


This  proves  that  \(u)  is  a  multiplicator  (3,  3). 

Necessity.  It  will  be  enough  to  show  that  if  X(u)  is  a  multipiicator  (4,  1) 

we  have  necessarily  —  —  ~e  (_F5). 

u  —  i 

In  fact  since  [1  —  |  u  \  /n]  e  (F±)  we  have  by  hypothesis  A(t*)[l  —  |  u  \  /n]  e  (f\) 
which  proves  by  result  2  that  A(^)[l  —  |  u  \fn\  is  continuous  and  hence  that 

—  ~    1  —  -  —  -    is  continuous  (for  every  n).  (29) 

u  —  i  [_         n  J 

Let  g(u)  €  (F4)  be  the  Fourier  transform  of  an  arbitrary  function  /(#)*€  {4}. 
Consider  the  functions  crn(x)  and  Sn(x)  defined  by  (16)  and  (17).  By  hypo- 
thesis g(u)\(u)£(F1).  Whence  by  theorem  I  follows  the  existence  of  a 
constant  A'  (depending  onf(x))  such  that 

rm\Sn(x)\dx^A'.  (30) 

J    -00 

But  we  have        <rn(x)  =  f  +*^^fl  -  ^~]e™*du,  (16') 

nv  '      J-^u-i     ^     L        n  1  ^      ' 

and  we  write  again  relation  (18) 
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We  want  to  show  that  there  exists  a  constant  A  such  that 

0 

I  o~n(x)  |  dx  ^  A     for  every  n.  (31) 

> 

In  fact  if  (31)  does  not  hold,  there  exists,  by  theorem  VII,  f  a  function 
f(x)  e  {4}  such  that 


/»  +  co     |*  + 

Mm  sup  j         J 

n->oo       J  -co    J  — 


dx  =  oo, 


contrary  to  (30).  Therefore  (31)  holds.  By  theorem  V,  (29),  (16')  and  (31) 

,    i\(u\    ,  „  v 
imply  -±-j€(F5). 

c*>  —  ti 

THEOBEM  X.  A  necessary  and  sufficient  condition  that  X(u)  be  a  multi- 
plicator  (1,2)  is  that  /\.(u)e(F2).  The  condition  is  the  same  for  multiplicators 
(\,Z)and  (1,4). 

Proof.  Sufficiency.  We  need  only  show  that  whatever  be  g(u)  e  (F^  and 
\(u)  e  (F2)  then  g(u)  A.(u)  is  the  Fourier  transform  of  some  function  abso- 
lutely integrable  and  uniformly  continuous  in  (—00,  +00). 

Let/(#)c{l}  and  l(x)€.{2}  be  respectively  the  generatrices  of  g(u)  and 
X(u).  By  result  7,  g(u)\(u)  is  the  Fourier  transform  of  the  absolutely 

integrable  function  i    /•  +  « 

—         /(*-g)I(g)dg.  (32) 

2?rJ  _„ 

But          f'l""{/(*  +  »-S)-/(* 

J   -CO 

<     u.b.    |t( 

—  00<^<OO 

-     u.b. 


<oo 


£)  |  f+" 

J  —  cx> 


This  last  integral  tends  to  zero  with  y  by  a  theorem  of  Lebesgue. 

The  function  (32)  is  thus  uniformly  continuous. 

Necessity.  We  need  only  prove  that  if  A(^)  is  a  multiplicator  (1,2)  we 
have  necessarily  X(u)  e  (^2)-  To  do  this,  we  shall  show  that  all  the  require- 
ments of  theorem  II  concerning  the  functions  A(^)[l  —  |  u\/n]  and  crn(x) 
defined  by  (16)  are  satisfied. 


f  Every  &n(x)  certainly  belongs  to  the  class  {!}.  To  see  this,  observe  that  the 
function  A(w)[l  —  j  u  |/w]  is  absolutely  integrable  in  (  —  00,  H-oo).  Moreover,  it  is  the 
Fourier  transform  of  some  function  (r*(x)  €  {!},  since  A(w)  is  a  multiplicator  (4, 1)  and 
since  [1  —  |  u  |/n]£(J?T4)  (result  1).  Hence,  by  result  5, 

<rn(x)  =*  cr*(x) 
almost  everywhere,  which  proves  that  crn(x)  belongs  to  the  class  {!}. 
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In  fact,  since  \(u)  is  already  a  multiplicator  (2,  2)  we  have  by  the  preceding 

theorem  — ^-4  €  (FR),  which  proves  that  — ^-4  is  continuous  and  so  that 
u—^    v  5  r  u  —  * 

\(u)  is  continuous  in  (  —  00,  -foo).  (33) 

Also,  by  theorem  V,  there  exists  a  constant  A  such  that 

0 

I  o"n(x)  |  dx  <  A     for  every  n.  (34) 


Let  now  g(u)  be  the  Fourier  transform  of  the  arbitrary  function  f(x) 
We  have,  as  in  (17),  (18), 


By  hypothesis  g(u)  A(^)e(J^2).  By  theorem  II,  the  functions  Sn(x)  are 
uniformly  bounded  in  n  and  x.  In  particular,  they  are  uniformly  bounded 
at  the  point  x  =  0.  We  conclude  that  there  exists  a  constant  B  such  that 

|  <rn(x)  |  <  B    for  every  n  and  x,  (35) 

for,  otherwise,  we  could  find,  according  to  the  theorem  of  Steinhaus,  a 
function  f(x)  €.  {1}  for  which  the  sequence  $^(0)  is  not  bounded.  By  theorem 
II,  relations  (33),  (34)  and  (35)  imply  A(^)e(.F2). 

Let  us  now  recall  a  theorem  of  Verblunsky.t 

//  in  the  finite  interval  (a,  6),  (a<b),  the  functions  <x,n(t)  are  bounded  and  if 


lim  r/(t)an(t)dt 

n-*-oo  J  a 


exists  for  every  f(t)  bounded  and  R-integrable  in  (a,  6),  then  the  sequence  of 
functions  ~x 

an(t)dt 
Jo 

tends  to  a  function  absolutely  continuous  in  (a,  6). 

THEOREM  XI.  A  necessary  and  sufficient  condition  that  \(u)  be  a  multi- 
plicator (2,4)  is  that  A(w)€(Ji'i).  The  condition  is  the  same  for  multiplicators 
(3,4). 

Proof.  Sufficiency.  We  must  show  that  the  relations  A(^)€(^i)  and 
<72(w)  e  (j^)  imply  A(u)g^(u)e.(F4).  But  this  is  an  immediate  consequence 
of  theorem  X. 

f  S.  Verblunsky,  loc.  cit.  theorem  I. 
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Necessity.  It  will  be  enough  to  prove  that  if  \(u)  is  a  multiplicator  (3,  4) 
then  necessarily  A(t*)e(^)  or  what  comes  to  the  same,  by  theorem  VI, 


Since  X(u)  is  a  multiplicator  (3,  3),  we  have  already,  by  theorem  IX, 

.€  (F5).  Let  then  s(x)  e  {5}  be  the  generatrix  of  —  M  ,  and  put 

u  •—  t>  u  —  -  i 

8*(x)  =  K*(a?  +  0)  +  *(a?-0)}. 

Evidently  s*  (x)  =  s(x)  almost  everywhere.  Thus3*(#)  yields  another  genera- 
trix for  —  ^-T  .  We  shall  prove  that  s*(x)  €  {6}. 

I.  We  shall  first  show  that  s*(x)  is  absolutely  continuous  in  every  finite 
interval  (  —  a,  a),  (a  >  0). 

g(u)  being  the  Fourier  transform  of  the  arbitrary  function  /(#)e{3}, 
consider  again  the  functions  sn(x),  crn(x)  and  8n(x)  defined  in  formulae  (14), 
(16)  and  (17).  We  have,  by  (19)  for  every  x  in  (  —  «,«), 


\*<rn(t)dt  -  -  \Xsn(t)dt-  \Xs'n(t)dt. 
Jo  Jo  Jo 


But  for  each  n  >  0  the  function  sn(x)  is  continuous  in  ( —  a,  a)  together  with 
its  derivative  s'n(x).  We  conclude 


Hence  sn(x)  -  an(0)  -  fX(*)  *  "  f %i(0  ^- 

Jo  Jo 

We  shall  show  that 

rx  fx 

lim  5W(^)  =  lim  sn(0)  -  lim       sn(t)dt-~  lim       <rn(t)dt, 

n->oo  w— >oo  n— >oojo  n->oojo 

where  the  hmit  of  each  term  of  the  second  member  exists  and  is  absolutely 
continuous  in  (  —  a, a).  Thus  the  function 

s*(x)  =  4{s(#  +0)  +  S(x-0)} 

will  have  the  same  property,  since,  by  Hardy's  theorem, 
lim  8n(x)  =  %{s(x  +  0)  +  » (a;  -  0)}. 

n.->oo 

We  first  have  lim  «n(0)  =  J{«(  +  0)  +  *(  -  0)}, 

n->oo 

which  is  a  constant. 
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On  the  other  hand,  sn(t)  tends  to  s(t)  almost  everywhere  and  it  remains 


bounded,  by  theorem  II,  since  —±~4z(F5)  C  (Fz).  Hence 

rx  rx 

lim       sn(t)dt  =       s(t}dt, 

n-*°ojo  Jo 

and  the  second  member  is  an  absolutely  continuous  function.   Finally, 

I   crn(t)dt  tends  also  to  a  function,  absolutely  continuous  in  ( -a, a). 
Jo 

To  see  this,  consider  the  subclass  {3'}  C  {3}  of  all  functions  bounded, 
JF?-integrable  in  (-a, a)  and  vanishing  outside  this  interval.  Let  f(x) e {3'} 
and  let  g(u)  be  its  Fourier  transform.  We  have  as  usual 

i  c+oo  ^      ^  i  f+« 

™  '~27rJ_oo  n        b  2nj-a 

By  hypothesis  g(u)  X(u)  €  (F4),  so  that,  by  Hardy's  theorem,  the  limit 

r+a  r+a 

n— >oo  n->ooj— a  n— >°oj  — a 

exists  for  every /(g)  e  {3'}.  We  conclude  from  the  theorem  of  Verblunsky  that 
lim   I   (Tn(t)  dt  exists  and  is  a  function  absolutely  continuous  in  ( —  a,  a). 

n-»-oo  J  0 

II.    We  shall  prove  now  that  s*(x)  is  absolutely  continuous  in  ( -  oo,  +  oo.) 

Let  e  be  an  arbitrary  positive  number.  Since  s*(x)  together  with  s(x)  is 

of  bounded  variation  in  ( —  oo,  +00),  and  since  the  variation  of  a  function  is 

an  additive  function  of  an  interval,  we  can  find  a  finite  interval  ( -  a,  a), 

(a  >  0),  such  that 


V{8*(x),(-<X),  -a)}+V{s*(x),(  +  a,  +co)}^e.  (36) 

On  the  other  hand,  since  s*  (x)  is  absolutely  continuous  in  (  -  a,  a),  we  can  find 
a  number  y  >  0  such  that, 


being  any  finite  sequence  of  non-overlapping  intervals  contained  in  (  —  a,  a), 
the  relation 


implies  £  |  «*(&)  -  **(«<)  I  <  ie- 

SEE.  2.     VOL.  50.     NO.  2391. 
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Let  now  (y^^),  •••>(ym>£m)  be  any  finite  sequence  of  non-overlapping 
intervals  whose  total  length  is  less  than  rj.  By  dividing,  if  necessary,  each 
of  the  two  intervals  containing  the  points  —a  and  -fa,  we  can  subdivide 
our  intervals  in  three  groups  contained  respectively  in  (  -  oo,  —  a),  (  —  a,  -fa), 
(  -fa,  +00).  We  have  then  by  (36)  and  (37) 


This  shows  that  s*(x)  is  absolutely  continuous  in  (  —  oo,  +00)  and  completes 
the  proof. 

THEOREM  XII.  A  necessary  and  sufficient  condition  that  X(u)  be  a  multi- 
plicator  (1,  5)  is  that  h(u)€(F5).  The  condition  is  the  same  for  multiplicators 
(2,  5),  (3,  5),  (4,5). 

Proof.  Sufficiency.  We  need  only  show  that  if  \(u)€(F5)  then  A(w)  is  a, 
multiplicator  (1,5). 

Take  an  arbitrary  function  gL(u)e(FL).  By  theorem  VIII,  — 

i 

multiplicator  (6,  1).  Whence  A(W)^I(M)  —  r—  e  (F^).  By  theorem  VI, 


This  proves  that  A(w)  is  a  multiplicator  (1,5). 

Necessity.  It  will  be  sufficient  to  show  that  if  A(^)  is  a  multiplicator  (4,  5), 
then  necessarily  A.(u)e(F5). 

I.  We  shall  first  show  that  if  A(^)  is  a  multiplicator  (4,  5)  it  is  also  a 
multiplicator  (1,5). 

Take  an  arbitrary  gl(u)e(Fl).   By  theorem  VIII,  —  .—  g^u)  is  a  multi- 

plicator (5,  1).  Therefore  the  function  A(^)  —  r~  g^u)  is  a  multiplicator  (4,  1). 

By  theorem  IX,  A.(u)gl(u)£(F5),  which  shows  that  X(u)  is  a  multiplicator 
(1,5). 

II.  Let  now  A(&)  be  a  multiplicator  (4,  5)  and  so,  by  I,  a  multiplicator 

(1,5).  Take  an  arbitrary  function  g^(u)€.(F^).  By  theorem  VIII,  —  r—  g^u) 

i 

is  a  multiplicator  (5,4).  Therefore  the  function  ^(u^—^-g^u)  is  a  multi- 


plicator (1,4).    By  theorem  X,  \(u)—r-g^(u)€.(F2)9  which  proves  that 
is  a  multiplicator  (4,  2).  Whence,  by  theorem  IX,  A 
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THEOREM  XIII.  The  conditions  characterizing  multiplicators  (j,  6) 
(j  —  1,2,3,4,5,6)  are  those  characterizing  multiplicators  (j,  1)  in  which 

--  r  is  replaced  by  X.(u).  The  conditions  characterizing  multiplicators  (6,  k) 
u  —  i/ 

(k  —  1,  2,  3,  4,  5,  6)  are  those  characterizing  multiplicators  (1,  k)  in  which  A(w) 

,       ,  ,     i\(u) 
is  replaced  by  —  —4. 
*  u  —  i 

Proof.  It  is  an  immediate  consequence  of  theorem  VI.  We  shall  give  it, 
for  example,  for  multiplicators  (6,  k). 

Let  A(w)  be  a  multiplicator  (6,  k).  Take  an  arbitrary  function  g^(u)€.  (F±). 
We  have 

)  e  (^T6),     gi(u)  -—.  A(w)  e  (Fk), 
— 


which  proves  that  —  ^-4  is  a  multiplicator  (1,  k). 

Conversely,  let  —  ^-4  be  a  multiplicator  (I,k).  Take  an  arbitrary  function 
gQ(u)z(FQ).  We  have 


which  proves  that  /\.(u)  is  a  multiplicator  (6,  k). 

Farouk  I  University 
Alexandria,  Egypt 
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RATIONAL  SOLUTIONS  OF  THE  MATRIX  EQUATION  XA=BX 

By  H.  O.  FOULKES 
[Received  19  July  1944  —  Read  16  November  1944] 

1  .    Introduction 

The  problem  of  determining  the  commutant  of  two  matrices  A  and  B 
has  been  discussed  by  many  writers.*  It  is  well  known  that  a  necessary  and 
sufficient  condition  for  XA  =  BX  to  have  a  non-singular  solution  is  that 
A  —  XI  and  B  —  A/  should  have  the  same  invariant  factors,  but  we  are  not 
necessarily  restricted  to  such  cases  in  this  paper.  If  the  n  x  n  matrices  A 
and  B  have  reduced  characteristic  functions  a  (A)  and  b  (A)  respectively, 
then  polynomials  /(A)  and  g(X)  exist  such  that 


where  A(A)  is  the  greatest  common  divisor  of  a(A)  and  6(A).  Hence 
Xh(A)  =  Xa(A)f(A)  +  Xb(A)g(A). 

Clearly,  a(A)  is  zero,  and  if  X  is  an  n  x  n  matrix  satisfying  XA  =  BX,  then 
XAr  =  BrX  for  r  any  positive  integer  and  so  Xb(A)  =  b(B)X,  which  is 
also  zero  since  b(B)  =  0.  Hencef  Xh(A)  =  0  and  similarly  h(B)X  —  0. 
In  the  extreme  case,  when  a(A)  and  6(A)  are  relatively  prime,  A(A)  and  h(A) 
are  respectively  unity  and  the  unit  n  x  n  matrix,  so  that  X  is  necessarily 
the  null  matrix.  Otherwise,  if  A  and  B  have  one  or  more  latent  roots  in 
common,  then  X  can  be  non-zero,  and  it  has  been  proved  J  that  the  number 
of  linearly  independent  solutions  of  XA  =  BX  is  Se#,  where  e^  is  the  degree 
of  the  greatest  common  divisor  of  the  invariant  factor  pi  of  A  —  A/  and  the 


*  C.  E.  Cullis,  Matrices  and  Determinoids,  3,  part  I  (Cambridge,  1926),  Chaps, 
xxvn,  xxvui.  H.  W.  Turnbull  and  A.  C.  Aitken,  An  introduction  to  the  theory  of 
canonical  matrices  (London,  1932),  Chap.  x.  C.  C.  MacDuffee,  The  theory  of  matrices 
(Berlin,  1933),  Chap.  vm.  The  last-named  author  gives  many  other  references. 

t  The  converse  is  not  necessarily  true;  if  Xh(A)  =  h(B)X  =  0,  it  does  not  follow 
that  XA  =  BX. 

}  C.  C.  MacDuffee,  loc.  cit.,  90-92,  ascribes  the  theorem  to  F.  Cecioni  (1909),  and 
G.  Frobenius  (1910). 
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invariant  factor  q3-  of  B  —  A/.  In  determining  the  number  of  solutions,  an 
invariant  factor  repeated  r  times  is  to  be  regarded  as  r  invariant  factors. 

In  this  paper  we  suppose  A  and  B  are  given  n  x  n  matrices  with  elements 
rational  numbers,  so  that  their  characteristic  equations  and  invariant 
factors  can  be  determined.  We  give  a  method  of  obtaining  sets  of  rational 
matrices  satisfying  XA  =  BX,  this  method  giving  the  exact  number  of 
solutions  prescribed  by  the  Cecioni-Frobenius  theorem. 

When  A  and  B  have  some  latent  roots  in  common,  as  they  must  have  if 
X  is  to  be  non-zero,  algebraic  equations  can  be  written  down  which  are 
satisfied  by  both  A  and  B.  Such  an  equation  will  contain  the  reduced 
characteristic  equations  of  A  and  B  as  factors,  but  need  not  be  their  product. 
In  §  6  a  method  of  obtaining  sets  of  rational  solutions  of  XA  —  BX  is  given, 
the  solutions  being  written  down  in  terms  of  the  elements  of  A  and  B  and 
the  coefficients  of  the  particular  algebraic  equation  chosen.  The  method  is 
more  limited  than  that  of  the  earlier  part  of  the  paper,  in  that  the  solutions 
obtained  are  not  all  linearly  independent,  and  the  sets  may  not  be  complete. 

I  am  indebted  to  Mr  D.  E.  Littlewood  of  University  College,  Swansea, 
for  reading  the  manuscript  and  for  very  helpful  criticism. 

2.    Outline  of  method 

If  C  is  a  square  ^-rowed  matrix  and  it  is  possible  to  find  non-zero  matrices 
(p  and  ^  such  that  <f>A  =  G<f>  and  Bi/r  =  i/rC,  then 


Hence  i/r<f>  will  be  a  solution  of  XA  =  BX.  We  will  choose  C  appropriate  to 
the  irreducible  factors  (or  the  powers  of  the  irreducible  factors)  of  the 
greatest  common  divisors  d^  of  the  invariant  factors  pi  of  A  —  XI  and  q^  of 
B  —  A/,  and  by  making  C  rational  we  obtain  rational  matrices  <p,  i/r  and  ty<j). 
We  show  that  every  irreducible  factor  (or  power  of  an  irreducible  factor)  of 
each  d^  gives  rise  to  a  number  of  linearly  independent  rational  solutions 
ijf(j)  equal  to  the  degree  of  the  factor. 

Throughout  the  paper,  questions  of  rationality,  reducibility,  and  linear 
dependence  are  to  be  understood  to  be  relative  to  the  field  of  the  elements 
of  A  and  B. 

3.    Solutions  when  d^  is  a  product  of  rational,  unrepeated,  linear  factors 

When  the  two  invariant  factors  pt  and  q^  have  a  common  rational  linear 
factor  A  —  a,  we  may  take  C  to  be  the  ^-rowed  square  matrix  with  a  in  the 
first  place  on  the  leading  diagonal  and  zeroes  elsewhere.  Then  <f>A  =  C(/> 
is  satisfied  by  a  square  w-rowed  matrix  <j>  which  has  all  its  rows  zero  except 
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the  first,  Y19  which  must  satisfy  Y±(A  —  a/)  =  0.  Also  Bft  =  ftC  is  satisfied 
by  a  square  rc-rowed  matrix  ft  which  has  all  its  columns  zero  except  the 
first,  Zlt  which  must  satisfy  (B  —  ocI)ZI  =  0.  Then  Y±  and  Z-^  are  respec- 
tively a  left  pole  of  A  and  a  right  pole  of  B  corresponding  to  the  common 
latent  root  a,  and  we  have  Y1A  =  aYl  and  BZ1  =  aZlf  giving 


thus  giving  ZiiYl  as  a  solution  of  XA  —  BX.  Such  a  solution  is  a  product  of 
a  column  and  a  row  vector  and  is  thus  a  square  matrix  of  unit  rank. 

By  considering  the  classical  canonical  form  of  A,  it  can  be  verified  in 
any  given  case  that  the  number  of  linearly  independent  rows  Y±  is  equal  to 
the  nullity  nA  of  A  —  a/,  and  is  equal  to  the  number*  of  invariant  factors 
of  A  —  A/  which  contain  A  —  a.  The  number  of  linearly  independent  columns 
Z±  is  nB,  the  nullity  of  B  —  aIy  and  is  equal  to  the  number  of  invariant 
factors  of  B  —  A/  which  are  divisible  by  A  —  a.  Hence  we  have  nAnB  rational 
square  matrices  Z^Y^  satisfying  XA  —  BX.  These  are  not  null  matrices 
and  they  are  linearly  independent,  since  any  linear  relation  between  them 
implies  a  linear  relation  between  the  nA  independent  rows  or  the  ns  in- 
dependent columns.  When  a  is  the  only  common  latent  root  of  A  and  B  and 
is  not  repeated  in  any  diit  then  there  are  nAnB  divisors  dy  each  equal  to 
A  —  a  and  each  contributing  one  of  the  independent  solutions  Z^Y^.  Hence 
we  obtain  the  correct  number  of  solutions  required  by  the  Cecioni-Frobenius 
theorem  in  this  case. 

Also,  when  A  and  B  have  several  distinct,  rational,  latent  roots  in  com- 
mon, none  appearing  as  repeated  roots  of  any  dy,  we  can  construct  a  set  of 
linearly  independent  solutions  corresponding  to  each  distinct  root.  These 
sets  are  linearly  independent,  for  any  linear  relation  between  the  solutions 
in  different  sets  implies  a  linear  relation  between  the  poles  belonging  to 
different  latent  roots.  It  has  been  proved  f  that  no  such  relation  exists. 
Furthermore,  the  HnAnB  products  ft</>  (or  Z^^),  summed  for  all  common 
latent  roots  of  A  and  J3,  constitute  a  complete  set  of  solutions  of  XA  =  BX 
in  the  case  when  every  dif  is  the  product  of  rational,  unrepeated,  linear 
factors;  each  dif  contributes  e^  solutions,  where  e^  is  the  degree  of  dijf 
This  may  be  verified  by  supposing  A  —  XI  has,  for  example,  three  invariant 
factors  with  latent  roots 

a,  a,  /?,  ft,  7,  d;     a,  a,  /?,  /?,  7;     a,/?, 
and  B  —  XI  has  four  invariant  factors  with  latent  roots 
a,  ft,  7,  7,  £,  e,  e;     a,  7,  y;     a,  7;     7. 

*  This  is  a  simple  case  of  a  more  general  result  given  by  C.  E.  Cullis,  loc.  cit.,  358-359. 
t  Cullis,  loc.  tit.,  396-398. 
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Then  for  a,  ft,  y,  <J  respectively  we  have  nAnB  equal  to  3x3,  3x1,2x4, 1x1, 
giving  T£nAnB  =21.  Also, 

6—  4.         o       —   0         £>       —  9         t>       —   1 
11  —  *»       612  —  ^>       e!3  —  ^>       "l4  —    1' 

e21  =  3,     e22  =  2,     e23  =  2,     e24  =  1, 
e31  =  2,     e32  =  1,     e33  =  1,     e84  =  0, 
giving  Se^  =  21. 

If,  however,  one  of  the  greatest  common  divisors  dy  has  repeated  linear 
factors,  then  ^LnAnB<^e^.  Thus,  if  A  —XI  has  the  single  invariant  factor 
(A  —  a)3,  and  B  —  XI  has  two  invariant  factors  (A  —  a)2,  A  —  a,  then  nA  ns  —  2 
and  Se^  =  3.  Such  cases  are  dealt  with  in  §  5. 

We  conclude  this  section  with  an  example  to  illustrate  the  method: 

Example  i: 

A  =  f     1-1-5       51,     B  =  f     5-6 

9-4 
4    -4 
L-8    -4 

There  are  two  invariant  factors  of  A  —  A/,  namely,  (A  —  1 )  (A  —  2)  (A  —  3)  and 
A  —  2,  and  two  invariant  factors  of  B  —  A/,  each  being  (A  —  2)  (A  —  3).  Hence 
.4  —  27  and  ^4  —  37  are  respectively  of  nullities  2  and  1,  giving 

0!  =  [0,  2,  8   -  7]     and     02  =  [2,  0,  0,  -  1], 

corresponding  to  the  latent  root  2,  and  03  =  [1,  2,  9,  —  9]  corresponding  to 
the  latent  root  3.  We  write  the  <p  matrices  as  row  vectors  since  all  the  other 
rows  are  zero.  Similarly,  B  —  27  and  B—3I  are  each  of  nullity  2,  giving  the 
column  vectors      ^  =  {0,  5,  2,  -  7},     ^2  =  {2>  ~  *>  °>  6} 
and  ^3  =  {5,  7,  4,  0},          ^4  =  {0,  3,  1,  -  5} 

respectively.  Hence  we  have  a  complete  set  of  six  linearly  independent 
solutions  of  XA  =  BX, 


10 

40 

-35 

4 

16 

-14 

..  -14 

-56 

49. 

4 

16 

-14" 

.   -2 

-8 

7 

12 

48 

-42. 

'5   10 

45 

-46" 

7   14 

63 

-63 

4    8 

36 

-36 

10 

.      -5 

4 

.      -2 

_ 

-14 

7. 

• 

4 

.      -2" 

-2 

1 

- 

12 

.      -6_ 

3         6 

27    -27 

1         2 

9      -9 

. 

-5   -10 

-  45       45. 
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4.    Solutions  when  d^  has  a  non-repeated  irreducible  factor  of  degree  m 
The  method  of  §  3  is  applicable  when  the  common  roots  are  irrational, 
but  as  the  solutions  would  also  be  irrational  we  now  consider  the  case  when 
di:f  has  a  non-repeated  irreducible  factor 

/(A)  =  A™  -  6j  A7"-1  -  &2  A™-2  - . . .  -  bm. 

We  take  C  of  §  2  to  be  the  ?&-rowed  square  matrix  with  the  m-rowed 
square  matrix  r  , 

1      b™ 


in  the  leading  position  on  the  principal  diagonal  and  zeroes  elsewhere.  We 
find  that  <p  has  its  last  n  —  m  rows  zero,  and  its  first  m  rows  Tlt  F2, . . . ,  Ym  are 
given  by  __ 

— 


from  which  it  readily  follows  that 


Also,  from  §  1,  each  Y  satisfies  Yf(A)  ~  0,  and  since  /(A)  is  irreducible, 
Y  cannot  be  annihilated  by  right-hand  multiplication  by  any  polynomial 
in  A  of  degree  less  than  m.  It  seems  most  convenient  to  derive  the  rows 
successively  from  the  mth  row. 

Since  Ck<f>A  =  CCk<f>,  from  any  solution  <f>  of  (f>A  =  C<f>  we  can  obtain  a 
set  of  matrices 


which  will  also  be  solutions.    Further,  these  solutions  will  be  linearly 
independent,  for  a  linear  relation 


leads  to  (f>(lt  +  12A  +  13A*+  ...  +lmAm~l)  =  0, 

which  necessitates  the  annihilation  of  each  row  of  <j>  by  a  polynomial  in  A 

of  degree  less,  than  m. 
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Similarly,  consideration  of  Bi/r  =  tyC  leads  to  ft  as  a  square  %-rowed 
matrix  with  its  last  n  —  m  columii3  zero  and  its  first  m  columns  Zlt  Z2,  .  .  .  ,  Zm 
satisfying  z,  =  5Zlf  Zt  =  BZ*  ...,  Zm  =  BZm_1 

and  bm  Zl  +  bm_!  £2  +  6ra_2  Z3+...  +  b1Zm  =  BZm  , 

from  which  we  find  that  each  column  Z  satisfies  f(B)  Z  =  0.  Since  f(B)  is 
irreducible,  Z  cannot  be  annihilated  by  left  multiplication  by  a  polynomial 
in  B  of  degree  less  than  m.  It  follows  that 

^,     ^(7,     f<72,     ...,     vW1"1 
constitute  a  set  of  m  linearly  independent  solutions  of  Bfr  =  i/rC. 

We  are  thus  led  to  a  set  of  products  i/rCk(f>,  where  k  =  0,  1,  2,  .  .  .  ,  2(m  —  1), 
each  being,  by  §2,  a  rational  solution  of  XA  =  BX.  Of  these, 


will  be  linearly  independent,  for  they  can  be  written  as* 


any  linear  relation  between  them  involving  the  annihilation  of  i/r<f>  by  post- 
multiplication  by  a  polynomial  in  A  of  degree  less  than  m;  since  ^0  can  be 
considered  as  a  column  matrix  in  which  each  term  is  a  linear  function  of 
Y1}  ...,  Ym  with  scalar  coefficients,  it  is  impossible  to  annihilate  such  a  func- 
tion by  a  polynomial  in  A  of  degree  less  than  m.  Hence,  corresponding  to 
the  factor/(A)  ofd^,  we  obtain  m  linearly  independent  solutions  of  XA  =  BX  . 

We  note  that  ^  and  ^  are  respectively  rectangular  nxm  and  mxn 
matrices  in  effect,  for  ty  may  be  written  as  [Z1}  Z2,  ...,  Zrn]  and  <f>  as 
{Fl5  F2,  ...,  T^J,  and  their  product  i/rcf>  is  a  square  ?i-rowed  matrix  of  rank  m. 
It  may  be  verified  directly  that  i/r<f>A  —  Bi/rfi,  using  the  recurrence  relations 
between  the  row  vectors  Y  and  the  column  vectors  Z. 

In  general,  /(A)  will  be  a  non-repeated  factor  of  more  than  one  d^.  The 
number  of  linearly  independent  sets  of  m  solutions  <j>  offiA  —  C(j>  is  equal  to 
the  number  nA  of  invariant  factors  of  A  —  A/  which  contain  /(A),  since  the 
total  number  of  linearly  independent  rows  satisfying  Yf(A  )  =  0  is  equal  to 
mn^,  the  nullity  of  f(A).  The  number  of  linearly  independent  sets  of  m 
solutions  of  B\lr  =  tyC  is  nB,  the  number  of  invariant  factors  of  B  —  A/ 
which  contain  /(A).  Hence  we  can  obtain  nAnB  sets,  each  of  m  products 
i/rCk<j>  (k  =  0,  1,  2,  ...,ra—  1),  corresponding  to  /(A),  each  product  being  a 
rational  solution  of  XA  =  BX.  We  have  shown  that  there  is  no  linear 
relation  between  the  m  products  in  any  set.  Any  such  relation  between 
products  in  different  sets  is  ruled  out  by  the  linear  independence  of  the 
mnA  solutions  of  <f>A  =  C<f>  or  of  the  mnB  solutions  of  Bi/r  ==  i/rC. 


*  They  may  also  be  written  as  TJS<J),  Bi/f(fi,  Bzi/r</>, . . ., 
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Further,  when  there  arise  several  irreducible  functions  /(A)  of  various 
degrees,  the  sets  of  solutions  corresponding  to  the  different  functions  will 
all  be  linearly  independent  since  they  are  annihilated  by  different  irreducible 
polynomials  in  A  or  B.  Hence,  corresponding  to  every  d^  of  degree  e^, 
consisting  of  a  product  of  irreducible  non-repeated  factors,  there  will  be 
e^  linearly  independent  rational  solutions  of  XA  =  BX,  the  number 
required  by  the  Cecioni-Frobenius  theorem. 

The  case  when  d^  contains  a  repeated  irreducible  factor  is  considered 
in  §5. 

mple  ii: 

-212       4' 
220-4 
-2126 
L-l     0     1        2. 

The  invariant  factors  of  A  —  A/  are  (A2  —  2A  —  2)  (A  4- 1)  and  A  + 1 ;  those  of 
B  -  A/  are  A2  -  2A  -  2  repeated.  If /(A)  =  A2  -  2A  -  2,  then 


Example  ii: 

A  = 

"-2 

1 

2 

2" 

2 

-4 

-8 

-6 

0 

1 

3 

2 

.-2 

2 

4 

3. 

4  _2  -2  -4 
-4446 
-2112 

6    _4    _4    _7j 


and    f(B)  =  0. 


As  a  row  vector  satisfying  Yf(A)  —  0  we  may  take  Y2  =  [1,0,2,0],  from 
which 

and     Cfa  =&A  =  f     2     0     4 
-238 


^=[--4     3     4     61 
[_     1     0     2     OJ 


01 
6J 


are  linearly  independent  solutions  of  <j>A  =  O0.  As  a  column  vector  satis  - 
fying/(J5)  Z  =  0  we  may  take  Z±  =  {1,  0,  0,  0),  giving 


1 

-2 


000 
2    -2    -1 


-2 
2 


2    -2   - 
4   -4   -2 


so  that  corresponding  to  dn  we  have  two  solutions  of  XA  —  BX,  viz. 


and 


'-63        .6" 

, 

2.4. 

-2      .    -4      . 

.-1      .    -2      ... 

^"i0i  = 

'     6    -6   -12 

-12" 

-4       6       16 

12 

4   -6   -16 

-12 

.2-3      -8 

-6. 
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Corresponding  to  d12  we  have  fa  and  Cfa  as  above,  and  the  first  column  of 
^2  may  be  taken  as  (0, 1,  0, 0},  giving 

jo    i    o    o^  _  n    2    i    o| 

^2  =  \1     2     1     OJ      an        ^2      =      ^2  ~~  \2     6     2     OJ' 
from  which  we  obtain  two  solutions 


and 


faCfa  =  fafaA  = 


-1.2." 

-2386 

1.2. 

'iMi  = 

"-23        8        6" 

-2     6     20     12 

-2386 

which  complete  the  set  of  four  required  in  this  example. 

5.    Solutions  when  dtj  has  repeated  factors 
Ifdy  has  an  irreducible  factor  /(A),  of  degree  m,  repeated  k  times,  e.g. 


then  we  take  (7,  as  in  §  4,  to  be  the  %-rowed  square  matrix  with  the  m^-rowed 
square  canonical  matrix  in  the  leading  position  on  the  principal  diagonal 
and  zeroes  elsewhere.  To  bring  this  case  into  line  with  the  method  of  §  4, 
we  have  to  show  (a)  that  there  is  a  set  ofmk  linearly  independent  solutions 
of  XA  =  BX  corresponding  to  each  dtj  containing  {/(A)}*,  (b)  that  sets 
obtained  from  different  dy'a  are  linearly  independent,  (c)  that  if  various 
dy's  contain  other  powers  fc1?  kz,  etc.,  of  /(A),  the  sets  of  mklt  mk2,  etc. 
solutions  will  also  be  linearly  independent,  and  (d)  that  these  solutions 
obtained  from  powers  of  /(A)  are  linearly  independent  of  any  solutions 
obtained  from  powers  of  other  irreducible  functions  of  A  occurring  as  factors 
of  some  of  the  dy'a. 

As  in  §4,  each  $  consists  of  mk  rows,  each  satisfying  Y{f(A)}k  —  0.  We 
choose  Ymk  so  that  Ymk{f(A)}k~l^Q.  Then  the  remaining  rows  are  obtained 
successively  from  Ymk  as  in  §4,  and  are  linearly  independent  since  any 
linear  relation  between  them  involves  annihilation  of  Ymk  by  post-multi- 
plying by  a  polynomial  in  A  of  degree  less  than  mk.  It  is  therefore  possible 
to  find  a  matrix  U,  of  n  rows  and  mk  columns,  such  that  (f>U  is  the  square 
rafc-rowed  unit  matrix.  Similarly,  ^  consists  of  mk  columns,  each  satisfying 
{f(B)}kZ  =  0.  We  choose  Z^  so  that  {f(B)}k~l  Z^  =J=  0,  and  derive  the  re- 
maining columns  successively  from  Z±  as  in  §  4.  These  columns  will  be  linearly 
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independent,  and  so  there  exists  a  matrix  V,  of  n  columns  and  mk  rows, 
such  that  Vi/r  is  the  square  m&-rowed  unit  matrix.  The  mk  products 


will  all  be  solutions  of  XA  —  BX,  and  will  be  linearly  independent,*  since 
any  linear  relation  between  them, 


)  <!>  =  o, 

leads  to  Vi/r(lQ  +  /x  G  +  12  <72  +  .  .  .  +  l^^  Cmk~l)  <f>  U  =  0, 

which  involves  the  vanishing  of  a  polynomial  in  C  of  degree  mk  —  1  ,  since 
Vijr  and  <j>U  are  unit  matrices.  The  canonical  matrix  C  cannot  satisfy  any 
equation  of  degree  less  than  mk.  We  have  thus  proved  (a). 

To  prove  (6)  we  have  to  consider  the  rows  of  0  in  greater  detail.  The  case 
of  repeated  factors  differs  from  §  4  in  that  certain  linear  functions  of  the  mk 
rows  are  annihilated  by  post-multiplying  by  certain  polynomials  in  A  of 
degree  less  than  mk.  None  of  the  m  rows 

V  V  V 

-Lmki       -^mk—l'       •••>       -'•mk—  m+I> 

nor  any  linear  function  of  them,  can  satisfy  Y{f(A)}k~1  —  0,  for  this  would 
involve  annihilation  of  Ymk  by  a  polynomial  in  A  of  degree  less  than  mk, 
in  virtue  of  the  recurrence  relations  of  §  4.  The  next  row,  Ymk_m,  is,  however, 
such  that  T  i  v  ,j  v  ,  ,/  v  .  7  v 

A/!  —  to^mfc  +  H^mfc-1  +  •  •  •  +  fcm-l  *  mk-m+1  +  imImk~m 

will  satisfy  L1{f(A)}k~~'L  =  0,  if  the  scalar  constants  lt  are  chosen  so  that 
lQ  +  l1(A-bl)  +  lz(A*-blA-b2)+...+lm(A™-blA™-*-...-bm) 

is  identically  equal  to/(u4).  Similarly,  there  exists  a  linear  function  of 

Y          V  V  V 

-*wfc>       Jwfc—  1>       •••?  J-mk—m>      ^mk—m—I' 

which  satisfies  L{f(A)}k~1  =  0,  and  so  on  for  each  row  up  to  ^fc-am+i- 
There  can,  however,  be  no  linear  function  of  these  2m  rows  which  will  satisfy 
L{f(A)}k~z  =  0,  because  of  the  initial  choice  of  Ymk  as  not  satisfying 
Ymk{f(A)}k-1  =  0.  Thus  the  first  m  rows  of  (j>  (starting  from  1^)  give  m 
vectors  satisfying  L{f(A)}k-1  —  0  and  not  satisfying  L{f(A)}k~2  =  0;  in  the 
same  way  the  third  set  of  m  rows  of  <j>  (starting  from  Ymk_2m)  leads  to  m 
vectors  satisfying  L{f(A)}k~2  =  0  and  not  satisfying  L{f(A)}k~*  =  0,  and 
so  on  for  the  remaining  rows  of  ^.  Each  of  the  mk  linearly  independent  rows 
of  0  defines  a  vector  satisfying  L{f(A)}k  =  0,  those  in  the  rth  set  not  satis- 
fying L{j(A)}k-*  =  0. 


*  This  proof  of  the  independence  of  the  mk  solutions  was  suggested  by  Mr  D.  E. 
Littlewood. 
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Now  suppose  {/(A)}fc  occurs  in  two  invariant  factors  a±  and  az  of  A  —  XI 
and  in  bl  of  B  —  XI.  Then  both  dn  and  <#21  contain  {/(A)}fc,  and  we  should 
obtain  mk  independent  solutions  of  XA  —  BX  corresponding  to  each. 
We  construct  those  corresponding  to  dlt  by  determining  ^  and  (f>1  as  already 
described,  and  forming  the  products 

fxC^i     (*  =  0,  l,...,w&-l). 

We  next  form  ^2>  corresponding  to  d2i>  by  choosing  an  initial  Ymk  satis- 
fying Ymk{f(A)}k  =  0  and  not  satisfying  Ymk{f(A  J}*-1  =  0,  and  at  the  same 
time  linearly  independent  of  the  rows  of  ^r  That  this  is  possible  is  known 
from  theorems*  on  the  nullity  of  {f(A)}k,  which  are  equivalent  to  the 
statement  that  we  can  determine  a  complete  set  of  unconnected  solutions 
of  L{f(A)}k  =  0  of  which  mnA  are  solutions  of  L{f(A)}k  =  0  and  not  of 
L{f(A  )}k~l  =  0,  where  nA  is  the  number  of  invariant  factors  of  A  —  XI 
which  contain  {f(A)}k.  It  is  thus  possible  to  choose  the  initial  row  Ymk  of 
^a  to  satisfy  the  above  conditions,  and  when  this  is  done  we  obtain  a  further 
set  of  mk  linearly  independent  solutions 


of  XA  =  BX.  Similarly,  when  {/(A)}*  occurs  in  a  third  invariant  factor  of 
A  —  XI,  we  can,  by  proper  choice  of  the  initial  row  of  <f>3,  obtain  another  set 
of  mk  linearly  independent  solutions 

^0^3     (i  =  0,l,...,mfc-l). 

Discussion  of  the  columns  of  ^  follows  the  same  lines  as  that  of  the  rows 
of  <j>,  and  it  will  follow  that  by  proper  choice  of  the  various  0's  and  ^'s  as 
regards  the  independence  of  the  initial  rows  Ymk  of  the  0's  and  the  initial 
columns  Z±  of  the  ^'s,  every  dif  containing  {/(A)}&  gives  a  set  of  mk  un- 
connected solutions  and  the  various  sets  are  also  independent. 

In  the  same  way,  d^-'s  containing  other  powers  klt  k2,  etc.,  of  /(A)  will 
give  rise  to  the  correct  number  ra&1}  ra&2,  etc.  of  solutions  if  <j>  and  ^  are 
properly  chosen  in  each  case.  The  $  (or  i/r)  corresponding  to  any  invariant 
factor  must  have  its  initial  row  Ymk  (or  column  Z^  linearly  independent 
of  any  rows  (or  columns)  used  for  another  invariant  factor.  That  there  are 
sufficient  unconnected  rows  (or  columns)  for  the  purpose  is  known  from  the 
theorems  on  the  nullity  of  powers  of/(^4)  and/(#). 

Powers  of  other  irreducible  polynomials  may  also  occur  in  some  of  the 
dy's.  The  solutions  corresponding  to  these  will  clearly  be  independent  of 
those  obtained  for  powers  of  /(A),  since  the  various  sets  have  as  annihilating 
polynomials  powers  of  different  irreducible  functions  of  A  or  B. 


*   Cullis,  loc.  cit.,  362,  387. 
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6.   Solutions  involving  the  coefficients  of  an  algebraic 
equation  satisfied  by  A  and  B 

The  method  already  outlined  gives  the  complete  set  of  rational  commu- 
tants  in  any  given  case.  We  now  indicate  a  method  of  obtaining  sets  of 
rational  commutants  when  A  and  B  satisfy  the  same  algebraic  equation. 
The  matrices  obtained  are  not  all  linearly  independent,  but  a  linearly 
independent  set  may  sometimes  be  picked  out  by  inspection  in  numerical 
cases  of  comparatively  low  order. 

If  the  minimum  functions  of  A  and  B  are  divisors  of 


then  A  and  B  will  satisfy  the  algebraic  equation  Fm(k)  =  0,  and  we  may 
choose  C  appropriate  to  Fm(A.)  instead  of  an  irreducible  factor  (or  power  of 
an  irreducible  factor)  of  one  of  the  d^s.  As  in  §  4,  we  choose  Ym  to  satisfy 
Y{Fm(A)}  =  0,  which  means  that  Ym  is  arbitrary.  We  can  take  it  to  be  any 
one  of  the  n  unit-row  vectors  /y  with  unity  in  the  ith  position  and  zeroes 
elsewhere.  Having  constructed  fa  with  /x  as  its  mth  row,  we  do  not  proceed 
as  in  §  4  to  find  (7^15  (720i»  etc.,  but  we  form  n  linearly  independent  matrices 
fa  each  with  its  mth  row  equal  to  It.  From  the  mode  of  construction  of  the 
successive  rows  it  is  clear  that  the  remaining  non-zero  rows,  Ym_lt  Fm_2,  .  .  .  ,  Y± 
of  the  ^-matrices  are  expressed  in  terms  of  some  of  blt  b2,  ...,  bm  and  the 
elements  of  A.  Thus,  when  m=  3, 

#,  =  {/,(4»-M-&«).  /,(4-&i),  /„  o,  ...,  o}. 

Similarly,  the  first  column  of  ^,  satisfying  {Fm(B)}  Z  —  0,  is  arbitrary 
and  may  be  taken  to  be  any  one  of  the  n  unit-column  vectors  J^.  We  form 
n  linearly  independent  matrices  ^  each  with  Jt  as  its  first  column.  Thus, 
for  ra  =  3, 

fa  =  [Ji9  BJit  B*Jt,  0,  ...,  0]. 

Hence  we  obtain  n2  solutions  of  XA  =  BX  of  the  type  fttfij.  They  are  not 
all  linearly  independent.  The  number  of  unconnected  solutions  obtained 
depends  on  the  relations  between  the  minimum  functions  of  A  and  B 
and  Fm(\),  and  has  to  be  investigated  in  any  particular  case. 

Solutions  obtained  in  this  way  have  the  merit  of  being  written  down  by 
comparatively  simple  rules  in  terms  of  the  elements  of  A  and  B  and  the 
coefficients  of  Fm(\).  For  example,  when  A  and  B  satisfy  the  same  quadratic 
equation,  we  have 

^  =  W,A^,0,...,0]    and 
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This  gives  a  simple  rule  for  writing  down  Xijt  viz.  X^  is  the  sum  of  two 
matrices,  the  first  having  its  ith  row  equal  to  the  jth  row  of  A  —  b{  with 
zeroes  elsewhere,  and  the  second  having  its  jth  column  equal  to  the  ith 
column  of  B  with  zeroes  elsewhere. 


As  an  example,  take 


A  = 


"     3 

1 

1 

r 

3 

-1 

-3 

-3 

1 

1 

3 

i 

.     2 

2 

2 

4. 

B  = 


Each  matrix  satisfies  A2  —  5A  +  6  =  0,  the  remaining  invariant  factors  of 
A  -A/  being  A -2,  A -2,  and  of  B-XI,  A2-5A+6.  By  the  above  rule, 
we  find 


x, 


3 

9 
4 

-8 
1      1 


1 


1 


1      1      1 


-6 
-6 

-4  .       . 

-4  .       . 

.      .  -6      .' 
11-61 

.      .  -4      . 

..      .  -4      .. 


-3    -1    -3    -3 
9 
4 

.      .    -8 
222       4 
.      .      .        9 
.       .       .        4 

..      .      .    -8. 

.      -6 
-3    -10    -3    -3 

.      -4 

,      .      -4 

'.  ,    .       .    -6" 

22     2-5 

4. 

..       .       .     -4J 


these  being  seen  to  be  independent  by  inspection  of  the  last  two  rows.  The 
remaining  eight  matrices  Xy  (i  =  3,  4;  j  =  1,2,  3,  4)  are  linear  functions  of 
these,  but  since  eight  solutions  are  required  by  the  Cecioni-Frobenius 
theorem,  our  solution  is  complete. 
Similarly,  when  m  =  3,  we  have 


In  a  numerical  case  we  could  obtain  X^  by  adding  three  matrices,  the  first 
having  its  ith  row  equal  to  the  jth  row  of  A^  —  blA  —  b2  with  zeroes  else- 
where, the  second  being  the  product  of  the  ith  column  of  B  by  the  jth  row 
of  A  —  bl9  and  the  third  having  its  ^'th  column  equal  to  the  £th  column  of 
52,  with  zeroes  elsewhere. 
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Applying  this  rule  to  Example  ii  of  §  4,  we  find  that 

A3_A2-4A-2  =  (A2-2A-2)(A+1)  =  0 
is  satisfied  by  A  and  B,  and  having  written  down 


.    -1        .    -2" 
-2    -2   -4 
-2224 

4   -2        .    -6J 


A-l  =   f-3        I        2       2' 

2    -5    -8   -6 
122 

.-2       2       4       2J 


-224  8' 

4     6      .  -8 

-4     2     6  12 

L-2      .      2  6J 

we  can  write  down  sixteen  solutions  of  XA  =  BX,  these  being  of  the  type 


L-2    -2    ~4 


+ 

'     4" 

[2    - 

5    -8    -6]  + 

8      .      ." 

-4 

.    -8      .      . 

6 

.      12      .      . 

.     2. 

6     .      .. 

•     8-12-32 

-24" 

f 

-8       12       32 

24 

12    -18    -48 

-36 

.     2      -6-20 

-12. 

When  A  and  B  satisfy  the  same  quartic  equation  we  find  in  the  same  way 


and  a  rule  can  be  formulated  in  which  Xtj  can  be  written  down  as  the  sum 
of  four  matrices. 

In  general,  when  A  and  B  satisfy  Fm(\)  —  0  (m  <  ri),  then 


m—  1 

{J  =  E 


m—l 

=  2 

p=0 


where 
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7.    Solutions  of  XA  =  A  X  and  XA  =  A'X 
when  A  is  an  elementary  matrix 

It  is  well  known  that  when  A  is  elementary,  its  minimum  and  character- 
istic equations  being  identical,  the  solutions  of  XA  =  AX  are  polynomials 
in  A.  If  Aft  (A)  is  the  expansion  as  a  polynomial  in  A  of  the  cofactor  of  a^  —  A, 
when  i  =  j,  or  of  a^  when  i  =£j,  in  |  A  —  XI  \  ,  then  the  solutions  X^  found  by 
the  method  of  §  6  can  be  written  as  (  -  l)n+*Atj(A).  Thus,  when  n  =  3, 

X^  =  AU  -  («22  +  a3 


^13  =  -4i3  +  a3i^>     etc-> 

where  Ay  is  the  numerical  value  of  the  cofactor  of  a^  in  the  determinant 
of  A.  As  a  matrix,  X^  may  be  written 


21 

a 


If  A'  is  the  transpose  of  A,  solutions  of  XA  =  A'X  are  not  in  general 
expressible  as  polynomials  in  A,  but  the  solutions  obtained  by  the  method 
of  §  6  are  closely  similar  in  form  to  those  of  XA  =  AX.  They  have  been  given 
in  some  detail  in  an  earlier  paper,*  where  the  similarity  was  observed. 

University  College 
Swansea 


*  H.  O.  Foulkes,  "  Collineatory  transformation  of  a  square  matrix  into  its  trans- 
pose", Journal  London  Math.  Soc.  17  (1942),  70-80. 
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ON  THE  SUMMATION  OF  MULTIPLE  FOURIER  SERIES.   I 
By  K.  CHANDRASEKHARAN 

[Received  26  May  1945.— Read  28  June  1945] 

1-1.  This  paper*  is  the  first  of  a  projected  set  of  papers  devoted  to  the 
study  f  of  multiple  Fourier  series.  While  single  Fourier  series  and,  to  a  lesser 
extent,  double  Fourier  series  have  been  investigated  in  great  detail,  multiple 
Fourier  series  have  not  received  the  same  degree  of  attention.  It  will  be 
interesting  to  know  how  far,  and  in  what  way,  the  results  on  ordinary 
Fourier  series  can  be  extended  to  the  case  of  multiple  Fourier  series.  Among 
the  various  possible  methods  of  considering  the  behaviour  of  this  series,  the 
one  adopted  by  S.  Bochner(l)  of  studying  it  as  a  simple  series  by  taking 
"spherical"  partial  sums,  instead  of  the  usual  rectangular  sums,  is  specially 
noticeable,  not  only  because  it  leads  to  elegant  results,  but  also  because  it 
accords  with  the  method  of  treatment  of  general  classes  of  expansions  in 
eigew-functions  of  which  multiple  Fourier  series  can  be  viewed  as  a  special 
case.  While  the  results  on  the  summation  of  expansions  in  eigre^-functions  (7) 
are,  in  a  sense,  more  general,  they  are  much  less  precise  than  the  corres- 
ponding results  on  multiple  Fourier  series;  in  the  latter  case,  however,  it 
should  be  observed  that  we  impose  the  condition  of  periodicity  on  the 
function  that  is  expanded. 

1-2.  In  this  paper  we  are  concerned  with  the  problem  of  summability  of 
multiple  Fourier  series.  We  connect  the  Riesz  summability  of  the  series  at 
a  point  with  the  behaviour  of  the  generalized  "spherical  mean"  of  the 
function  near  the  point.  In  one  dimension,  this  summability  will  turn  out  to 
be  equivalent  to  Cesaro  summability,  and  the  results  of  Hardy,  Little  wood 
and  Bosanquet(5,  2)  connecting  the  Cesaro  summability  of  Fourier  series 
with  the  Cesaro  mean  of  the  function  are  obtainable  as  corollaries  therefrom. 


*  Added  22  October  1947.— The  results  of  this  paper,  as  well  as  those  of  the 
succeeding  paper,  were  announced  in  Proc.  Indian  Acad.  Sci.  xxiv,  2,  A  (1946), 
229-232. 

f  My  thanks  are  due  to  Dr  S.  Minakshisundaram,  by  whom  I  was  prompted  to 
study  this  problem. 
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Let  f(x)  =  /(#!,  ...,xk)  be  a  function  of  the  Lebesgue  class  L,  periodic 
n  each  of  the  k  variables,  and  having  the  period  2ir. 


o^yE  f+"-  rV(*) 

£m)      J   —It  J   —IT 


Che  series  Soni  ^    >nkeKnlxl+...+nkxk)  js  caueci  the  multiple  Fourier  series  of 
he  f  unction  /(cc),  and  we  write 

f(x)  ~  Xanit_.nke«»^+~  +»***>.  (1-21) 

(1-22) 


lenote  the  "spherical"  partial  sum  of  the  series  (1-21);  that  is,  we  shall 
jonsider  it  as  a  simple  series 

i     S  ani          c««i*i+  »•+****>,  (1-23) 

j=0  v-J?j 

yhere  J?^  is  the  sequence  of  all  integers  that  can  be  represented  as  a  sum  of 
;  squares. 

Let  S^l-X  .....  njf  ««".-!+  •••+»»*»>,  (1-24) 


o  that  Sj^  is  the  Riesz  mean  of  the  series  (1'21),  of  type  vz  and  order  S.  If 

lim  S8R  exists  and  is  finite,  then  the  series  (1*21)  will  be  summable  (v2,  S). 

e->oo 

"f  k  =  1,   this  becomes  summability   (n2,  8),   which   is   equivalent  (4)   to 
ummability  (C,d). 

Let  f(x,  t) 


v^here  or  denotes  the  unit  sphere  £|  +  .  .  .  +  £|  =  1  and  rfo-g  its  (fc  —  1  )  -dimensional 
'•olume  -element,  so  that  f(x,  t),  which  exists  almost  everywhere,  as  a  function 
>f  the  single  variable  t,  is  integrable  in  every  finite  interval,  and  represents 
he  (k—  1)  -dimensional  "spherical  mean"  (except  for  the  factor  2Kfc-2>r(£&)) 
>f  the  function  f(x)  over  a  sphere  of  radius  t  with  centre  at  the  point  x.  If 
;  =  1,  then  , 


dy,  (1-25) 

vhere  dy  is  the  fc-dimensional  volume-element  &ndp  >  0,  2  (l/i  —  x€)2  —  s*  ^  ^2. 

i 

*  c,  clf  c2,  etc.,  are  some  constants. 

02 
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If  p  =  0,  we  define  f0(x,  t)  =  f(x,  t) ;  fp(x,  t)  =fp(t)  is  called  the  "spherical 
mean  "  of  the  function  f(x),  of  order  p.  This  exists  for  almost  all  t  if  0  <  p  <  1 , 
and  is  absolutely  continuous  in  every  finite  interval  excluding  the  origin 
if  p  >  1 . 

If  fp(t)  tends  to  a  limit  as  tf-^-0,  then  f(x)  is  said  to  have  a  "mean  limit" 
of  order  p  at  the  point  x.  If  k  —  1, 

-^2J         f(x,8)d8, 

so  that  whenever  lim/J?(<)  exists,  the  pih  Cesaro  mean  of  the  function  f(x,  t) 

tends  to  a  limit,  and  conversely. 

The  connexion  between  multiple  Fourier  series  and  Fourier  series  of 
eigen  -functions  will  be  clear  if  we  consider  the  boundary- value  problem, 

with  the  condition  that  u(x)  is  periodic  in  each  of  the  k  -variables,  with 
period  2?r.  It  is  easily  seen  that  the  elementary  exponentials 


where  (nk)  are  all  integers,  form  a  complete  set  of  regular  solutions  of  (1-26). 
Since  the  eigen  -values  are  given  by  A  =  TI|+  ...  -f  n|,  the  method  of  writing 
the  series  (1-21)  in  the  form  (1-23)  is  simply  the  arrangement  of  the  terms 
of  the  series  (1-21 )  according  to  the  order  of  magnitude  of  the  eigen-  values  A. 

1-3.   We  shall  state  here  some  of  the  notations  and  formulae  which  will 
be  frequently  used: 


(1-31) 


It  is  easily  verified  that 


if  p  -f  q  ^  1,  and  hence  it  follows  that 
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Let  Jp(x)  denote  the  Bessel  function  of  order  /i.  Let 

rf(x)  =  Jf(x)ix". 

Jf(x)  =  O(x>')  as  x-*0;     J^x)  =  O(x~t)  as  a;-»oo,  (1-34) 


r°°  T  (t\ 
Jo  r=» 


for     (x/K.  +  f,  (1-36) 


/       ^w-/*-! 

• 


-^^     =  2^^(7^ 

and  =0,     if    n><o.(3)  (1-37) 

2-1.    Preliminary  lemmas 

LEMMA!:  f  V-1  1  /(j»,  *)  |  dt  =  O(w&),     00    ^->oo,  (2-11) 

Jo 


),    /or    ^?^1,     a*     u-*ao,  (2-12) 

(f>p(u)  =  0(u*P+k~2),    for    p^ly     as    u-+oo.  (2-13) 

(2*11)  has  been  proved  by  Bochner((l),  189  last  line);  (2-13)  follows  from 
(2-12)  in  virtue  of  (1-31).  (2-12)  is  proved  thus: 


-0(1),     by  (2-11). 
Now,  using  the  formula  (1*33),  we  easily  prove  (2-12). 

LEMMA  2.    // 


and  Gp(u)  =  f  W^^+fc  -i  |  gp(t)  |  rf^  w^ere     grp(0  =  fp(t)  -  1, 

then  ®f(u)  =  O(^2^-ffc),     as     ^->oo,  (2-14) 

Fp(u)  =  O(u*v+k),     as    ^->oo,  (2-15) 

Op(u)  =  O(u2*>+k),     as    ^->oo.  (2-16) 
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(2-14)  and  (2-16)  easily  follow  from  (2-15)  which  is  proved  thus: 
J  V+*-i  |  fp(t)  \dt=0  1  JJ  (u*  -  s*F  8*-*  |  /(a?,  s)  |  efe  j 

-  0(u*v)  fV-1  1  /(a,*)|«fe 
Jo 

=  O(u2v+k),     by  (2-11). 
2«  2.   Bochner  has  proved  the  following  fundamental  result  ((  1  ),  176,  1  89)  : 

8&x)  =  2T(*  +  1)  J2*  f°V-y(*,  t)VMk(tR)dt,  (2-21) 

Jo 

provided  #>  J(&-1). 

0,  and  d>%(k—I),  we  have 


rtk-lf(x,t)VMk(tR)dt  -  o(l),     as     R-+OO,       (2-22) 
J  i 

uniformly  for  all  x.  For, 

by  (1-34) 


Combining  (2-21)  and  (2-22),  we  obtain,  for  £>£(&-  1), 

>8fJ(aj)  =  2«r(a+l)JR*  f%*-y(«,OTJ+u(«-B)c«  +  o(l),  (2-23> 

Jo 

which  shows  that  7?ie5s  summobility  of  type  vz  and  order  S  >  %(k  —I)  is  a  local 
property  for  multiple  Fourier  series,  when  summed  ''spherically".  That  this 
property  ceases  to  hold  when  £  =  i(&-  1)  and  k>  1,  has  been  shown  by 
Bochner  ((1),  193,  Th.  vn). 

3-1.    Fundamental  relations.  In  this  section,  we  generalize  the  formula 
(2-21)  and  prove  its  reciprocal. 

THEOREM  1  .  //  h  is  the  greatest  integer  less  than  p,  where  p>Q,  then 


provided  that  8>h+  J(Aj—  1). 


215 
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Proof.   The  case  p  =  0  of  (1-35)  gives  Vfa)  =  -xV^(x),  and  so,  inte- 
grating the  right  side  of  (2-21)  by  parts  h  +  1  times,  we  get 


(3-12) 


provided  8  >  h  +  \(k  —  1  ),  on  using  lemma  1  at  each  stage  of  partial  integra- 
tion. If  p  is  an  integer,  h+1  —  p,  and  (3-12)  leads  directly  to  the  result 
(3-11).  If  p  is  not  an  integer,  substituting  for  <fih+i  in  terms  of  <f>p  as  in  (1-32), 
and  setting  /?  =  S  +  $k  +  h+  1,  we  write  (3-12)  as 


_.  _ 
—  19+1) 


f°°  r°° 

2  I        <j,A    (o\fjo   I       //2 
Jo  J*  ~ 


since 


/*»l  /*  00 

lim       a<l>p(8)d8\    (t*-s2)h-*>tVp(tfi)dt  =  0,     for 

TO->ooJo  Jm 

which  is  proved  as  follows:  for  s<m,  and  a  fixed  R  >  0, 


f  °°(*2  - 

J  m 


^  (m2  -  52)7t  -P  max 


f  m 

J  m 


so  that 


Now  (3-13)  can  be  written  as 


1)  l» 


,     by  (1-35), 
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From  this  formula  it  follows  that  a  necessary  and  sufficient  condition  for 
the  summability  (y2, 8)  of  the  multiple  Fourier  series  is  that 

lim 

should  exist,  where  8  >p  +  $(k  —  1). 

Remarks,  (i)  In  the  above  theorem,  if  0<p<  1,  we  naturally  interpret 

(ii)  If  p  ^  I ,  we  can  actually  prove  a  relation  which  is  sharper  than  (3-11). 
The  sharper  form  will  not,  however,  be  needed  in  this  paper,  but  it  leads  to 
an  interesting  theorem  which  is  connected  with  the  question  of  absolute  sum- 
mability. For  that  reason  it  will  be  proved  as  a  lemma  in  the  second  paper 
of  this  series.* 

THEOREM  II.  If  p  >  1 ,  and  8  >  %(k  —  1 ),  then 


Proof.  From  (1-24)  it  follows  trivially  that  S"R=  O(Rk),  and  so  the  integral 

J  =  y* 
converges  for  y  >  2#-hf.  Hence,  by  (2-21), 

J  =  2T(<?+l)2/2fi+2  [™  R*+**+*Vv(Ry)dR  (  ~  tk~*f(x,t 

Jo  r  Jo 

=  *~f^£~  J"  «"I-'-V(^,  0  *  J"  R^>+l^Js+tk(tB)  Jy(yR)  dR, 
as  the  double  integral  is  absolutely  convergent.  Hence,  by  (1-37), 


forp  =  7-^-^, 


*  Immediately  following  this  paper  in  these  Proceedings. 
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3-2.    Theorems  on  summability 
THEOREM  III.  Iffp(t)  ->last-+0,  then 

lim  S&x)  =  2«fc-2>r(p)/,    for    d>p  +  \(k-  1). 

.R-»oo 

Proof.  Let 

MO  -/,(')-», 

/=  8sB(x)-  2«*- 


f  " 

Jo 


, 

where 


_  __ 

since  £*^  JJW-iFi+p 

Let,  for  0<9/<oo, 

°0  /2)     sayj  (3.22) 


JJ  f^-» 


=  o(l),     if    8>p  +  %(k-\).  (3*23) 

Using  (1-34), 

,  j  ,_  0/  _  1  _  \  f 
1   2l  ""     \jR*-*fc-"p+v  J  v 


Ew_r  +fl  f00  «P(«MLI 

+i<*+»|,  J,    ^+»+«*+3)J 


=  0(1),     if    8>p  +  %(k-I).  (3-24) 

From  (3-23)  and  (3-24)  it  follows  that  |  /  1  =  o(l)  for  d>p  +  \(k-  1),  from 
which  results  theorem  III. 

Remarks,  (i)  At  a  point  of  continuity  of  f(x),  we  have  S^(x)->f(x)  as 
^->oo,  for  *>  4(fc-  1),  since/(o;,i)->{2i<fc-2)r(p)}-1/(^)>  as  *->0. 

(ii)  If/(ic)  is  a  continuous  periodic  function,  f(x,  t)  converges  uniformly 


to  -Kfc_  as  t  ->  0  for  all  x\  and  hence  ^  converges  uniformly  to  /(#). 

Theorem  III  is  a  generalization  of  a  well-known  result  of  Bosanquet(2). 


218  K.  CHANDBASEKHABAN  [28  June 

THEOBEM  IV.    // 
(i)  fp+i(t)~l  =  o(I),     as 


as 


then  lim  SR(X)  =  L, 


where  8>p  +  %(k-l)  and  L  =  2 
Proof.    From  (3-21)  we  have 


r  ri/jR      r°°  "1 
+ 

LJo         Ji/Rj 

=  J1  +  J2,     say, 

(fl/jR  \ 

Rk+zP         tk+2P-i  |  ^(^  1^1  =  0(1),     by  hypothesis  (ii)  ; 


if  3  >  p  4-  4(&  —  1  ),  on  using  (2-  1  6)  in  partial  integration. 

Hence  the  multiple  Fourier  series  is  bounded  (*>2,  #),  for  £>^>4-  J(&—  1); 
by  a  well-known  result,  it  follows  that  the  series  is  either  summable  for  every 
8>p  +  i(fc—  1),  or  none  at  all  (6);  but,  by  hypothesis  (i)  and  theorem  III, 
it  is  summable  (vz,  3)  for  8  >  p  +  1  +  \(k  -  1  )  to  the  sum  2**-2>r(p)  L  Hence, 

lim  8'R(x)  -  2i<*-2>r(p),     for 


Remarks,    (i)  In  the  above  theorem,  conditions  (i)  and  (ii)  could  be 
replaced  by  i 


(ii)  Ifp  =  Oand&  =  1,  we  obtain  Lebesgue's  criterion  for  the  summability 
of  ordinary  Fourier  series. 

(iii)  Instead  of  p+  1  occurring  in  condition  (i),  we  may  have  any  q>p. 
For,  condition  (ii)  of  the  theorem  alone  implies  that  the  series  is  bounded 
(v2,  d)  for  8>p  +  %(k—  1),  so  that  for  the  summability  (v2,d)  of  the  series, 
it  is  sufficient  if  the  series  is  summable  for  some  8  and  this  will  be  the  case 
if  lim/JJ)  exists  for  any  q  >p,  by  theorem  III. 


1945]  SUMMATION  OF  MULTIPLE  FOURIER  SERIES 

THEOREM  V.  Iffp(t)  -I  =  O(t«),  as  t -> 0,  a >  0,  then,  for 


we  have 


.    tf 


,   if 


Proof.   From  (3-21)  we  have 

/  =  s'K(x)-L  = 


As  in  (3-24)  we  have     |  /2  |  =  oi 


We  may  consider  /t  as 

r~  fi/R      r 

I  =  £Rk+2p\   I       + 
1  Uo        J] 

—  /!,!  +  Iit  2,     say. 
|'M|  =  °( 


-^  '    if   ft 

) 

log  R\ 

— P —  I ,     it 

4).     ^    A 
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(3-25) 


(3-26) 
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since  Gp(t)  =  0(tk+2*>+*),     as 

by  hypothesis.  Now 


[28  June 
(3-27) 


so  that                |/1>2|  =  0h^)lW)- 

/  1  \         /  ] 
that  is                           —Oi-f^i  +  Oi-^ 

\Rp]        \-S 

and                               =  (?(^)[O(1)~ 

Q{  __L  j  f     if 

Hence                1  A,  a  1  ~ 

OJ  -^  J  ,     if 

»(¥). 

),     if 

=  O(l)  +  O(log/J),     if 
if    /? 


if 


if 


(3-28) 


Combining  (3-25),  (3-26)  and  (3-28),  we  get  the  required  result.  It  is  clear 
that  our  hypothesis  may  be  replaced  by  the  more  general  condition  (3-27). 


THEOREM  VI.  //  ->  aa,  then 

where 


if 


This  is  a  generalization  of  a  theorem  of  Szasz  (8),  and  the  proof  can  be 
easily  constructed  on  the  same  lines  as  in  our  earlier  theorems. 

3-3.    Theorems  on  the  "mean  limit"  off(x) 
THEOREM  VII.  //  S'k(x)->s  as  R-*co,  then 


provided  that  p  >  max  (1,7  —  \(k  —  3)). 

Proof.  We  prove  the  theorem  using  the  formula  (3-14), 


which  holds  only  for  p  >  1  and  $  >  \(k  —  1  ).  If  7  >  %(k  —  1  ),  we  choose  d  —  y  ; 
if  7  ^  %(k  -  1  )  choose  8  =  J(  k  -  1  )  +  0  where  6  >  0  is  arbitrary. 


1945]  SUMMATION  OF  MULTIPLE  FOURIER  SERIES  221 

Setting  /  =  c?/2*+2  f  P  +  J°°l  =  /!  +  72,     say, 


we  obtain 


=  o(l),     as     2/->0,  (3-31) 

=  o(f*+*  JJ 


=  o(l),     if    p>d-$(k-3).  (3-32) 

From  (3-31)  and  (3-32)  the  result  follows. 

This  again  is  a  generalization  of  a  theorem  of  Bosanquet(2). 

THEOREM  VIII.  //  8^(x)  -s  =  o(-^~\  5asjR->oo,0<a<25  then 


provided  that  p  >  max  (  1  ,  7  —  \(k  —  3)  +  a). 

Proof.  Let  S  and  /  have  the  same  significance  as  in  Theorem  VII.  Then 
1  implies  S^{  —  s  =  O\^}  f°r  #>y>  since  0«x<  2.  Now 


-  0(y2S+*)  =  O(y«),     since     0  <  a  <  2  ;  (3-33) 

1  =  0(^+2  j 


=  0(y«).  (3-34) 

Combining  (3-33)  and  (3-34)  we  get 

|  /  |  =  0(y*).  (3-35) 

Theorems  VII  and  VIII  are  converse  to  theorems  III  and  V. 
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3-4.  Combining  theorems  III  and  VII  we  can  state  the  following  analogue 
of  a  theorem  of  Hardy  and  Littlewood  (5)  : 

THEOREM  IX.  A  necessary  and  sufficient  condition  that  the  multiple 
Fourier  series  of  a  function  f(x)  should  be  summable*  (spherically)  at  a  point 
x  is  that  the  mean  limit  of  some  order  of  the  function  exists  at  the  point. 

Again,  combining  theorems  IV  and  VII,  we  state  the  following  theorem 
(Bosanquet  (2)): 

THEOREM  X.   // 

=  0(1),     as 


or,  in  particular,  iffp(s)  —  O(l),  then  the,  multiple  Fourier  series  off(x)  is  either 
summable  (  vz,  8)  for  every  8>p  +  \(k  —  1  )  or  for  no  8  ;  a  necessary  and  sufficient 
condition  for  it  to  be  summable  is  that  fg(t)  ->  Z,  as  t  ->  0,  for  q>p  +  l. 

In  conclusion  we  may  point  out  that  the  method  of  this  paper  applies 
mutatis  mutandis  to  multiple  Fourier  integrals  (cf.  Bochner,  (1)). 
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ON  THE  SUMMATION  OF  MULTIPLE  FOURIER  SERIES.  II 

By  K.  CHANDRASEKHARAN 
[Received  26  May  1945.—  Read  28  June  1945] 

1.  In  paper  I*  (4)  we  discussed  the  summability  of  multiple  Fourier 
series  by  Riesz's  means  of  suitable  type.  Here  we  discuss  the  absolute 
summability  of  the  series,  and  obtain  results  which  are  sharper  than  those 
on  series  of  eigenf  unctions  (3)  and  which  lead  to  results  equivalent  to  those 
of  Bosanquet(l)  in  one  dimension. 

Let  f(x)  =  f(xlt  ...,  xk)  be  a  function  of  the  Lebesgue  class  L,  having  the 
period  %TT  in  each  of  the  k  variables.  Let 

/(*)  ~  S  awi  .....  »*e«»i*i+-  +w***)> 


/„(*,«)  =  /,(<)  =  ~k  jf(y)  (i-J)1"1^ 

where  p>  0,  and  c  is  a  constant,  <fa/  is  the  ^-dimensional  volume-element  and 


where  cr  denotes  the  unit  sphere  £f  +  .  .  .  +  g|  =  1  and  dcr^  its  (  k  —  1  )  -dimensional 
volume-element. 
fp(t)  is  the  spherical  mean  of  order  p  of  the  function  f(x)  at  the  point  x. 

*  Published  in  these  Proceedings. 

f  Throughout  this  paper,  the  c's  stand  for  some  constants, 
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In  theorems  III  and  VII  of  Paper  I  (4)  we  proved  the  following: 
(a)  Iffp(t) 


It-^-co 

(b)  If  S?R(x) -> I  as  #->oo,  then  limfp(y)  -. 

p  >  max  (l,y  —  ^(k  —  3)). 

It  is  our  object  to  prove  that,  iffp(t)  is  of  bounded  variation  in  0  <  t  <oo, 
then  S*R(x)  is  of  bounded  variation  in  0  <  R  <  oo;  in  other  words,  the  multiple 
Fourier  series  is  absolutely  summable  (v2,  #),  or  summable  |  y2,  #|,  and 
conversely.  If  k  —  1 ,  summability  |  v2, 8  \  is  known  to  be  equivalent  to 
absolute  Cesaro  summability  |  O,  £  |  (2). 

We  prove  besides  that  Riesz's  summability  |  v2,  S  \  is  a  local  property  for 
multiple  Fourier  series  for  8>  %(k+  1),  thus  generalizing  a  known  result  on 
single  Fourier  series,  and  also  establish  a  connexion  between  summability 
|  v2, 8 1  and  summability  (y2,  d). 

2.  Let  J^t)  be  the  Bessel  function  of  order  /i\  V^t)  =  Jp(t)/V>.  Then  we 
have  the  following  formulae  (5): 

.cos(*  +  c),^/  M      as    ^TO>  (2ll) 


,     for 

LEMMA*  1.  If  p^  l,fp(t)  =  O(l),  as  £->oo. 
LEMMA  2.    If  p^  1,  then  fp(t)  =  ~  \cjp_^(t)  +  cjp_z(t)],  for 

For,  in  this  case,  fp(t)  =  c<f>p(t)/tk+2v-2 

is  absolutely  continuous  in  every  finite  interval  excluding  the  origin,  so 
that  almost  everywhere, 

w>~*&-'*&- 

i 
But 

Hence  fp(t)  — 

The  next  lemma  relates  to  a  sharpening  of  relation  (3-11)  of  Paper  I 
(of.  remarks  at  the  end  of  theorem  I  of  that  paper).  It  should  be  noted, 
however,  that  the  present  lemma  does  not  entirely  replace  the  previous 

*   (4),  (2-12). 
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theorem,  because  the  sharper  result  of  this  lemma  holds  only  under  an 
additional  restriction,  namely,  p  ^  1  .   Thus  we  have  : 

LEMMAS.    //  8>p  —  1  +  %(k—  1),  p^  1,  then 


r 

Jo 


(2-3) 


Proof..  Case  (i).  Suppose  p  ^  1  and  p  is  an  integer.  Then  this  lemma  is 
a  direct  consequence  of  theorem  I  of  the  previous  paper. 

Case  (ii).  Suppose  p^  I  and  p  is  not  an  integer.  Let  p  =  q  +  r,  where 
0  <  q  <  1  and  r  is  an  integer.  Then,  using  theorem  I  of  the  previous  paper, 
we  can  at  once  say 

SsK(x)  =  2ST(S+  1  )  Jiw*  r^WW^t-R)  dt,  (2-4) 

J  0 

provided  that  d>  \(lc—  1). 

We  now  integrate  the  right-hand  side  of  (2*4)  by  parts  r  times,  and  each 
time  use  the  relation 


(cf.  lemma  1)  and  obtain 

cS&x)  =  It*™  f 

Jo 

provided  that  S>q  +  r—  1  +  %(k  —  1  ),  which  is  the  required  formula. 

3-  1  .    THEOREM  I.    //  fp(t)  is  of  bounded  variation  in  0  <  t  <  oo, 
is  of  bounded  variation  in  0  <  Jf2  <  oo,  /or  #  >  p  4-  %(k  —  1  ). 
Pr00/.   Write 


(v)  dv. 
o 
Then,  if  8>p  +  $(k-  1), 


ru 

= 
Jo 


i/r(u)  =  O(l/w),     as     w^oo.  (3-11) 

Now  cfl&aO  =  ^  r^(tE)fp(t)  dt  =  f  "/p 

Jo  Jo 


Joo  /»<7 

<%>(*) 
o  Jo 


).  (3-12) 

o        o 
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The  last  step  is  permissible  since,  in  view  of  (3-11),  the  integral  is  absolutely 
convergent. 

From  (3-  12),  substituting  for  i/r,  it  follows  that 


d8»s\^  f  V+«-^   PV«%rtW(ta)  dfp(t) 

Jo  Jo 

<  r«*4*->  A,  fv*'  i  vp+tk+s(tu)  \\dfp\ 

Jo  Jo 

=|  "<*+*•  I  #„  1  J  V+2"-1  I  JU**<*)  I  dM> 

which  is  justified  since  8  >  p  +  £(&  —  1  )  . 

foe  /•!/«        p<» 

Let      J-      «*+a*-MW-f*N)ld<*  =       +       =  </i  +  <4     say. 

Jo  J  o       J  nt 


Then 


so  that  \ 

Using  this  in  (3-13)  we  obtain 


which  is  the  required  result. 

THEOREM  II.   //  S*R  is  of  bounded  variation  in  0  <  R  <oo,  ^en  /p(£)  i5  o/ 
bounded  variation  in  0<t<co  for  p  >  max  (  1  ,  8  —  \(k  —  3))  . 
Proof.   Using 


which  holds  for  p  >  1  and  S>  $(k—  1),  and  applying  the  same  argument  as 
in  theorem  I,  we  get  the  result. 

3-2.  By  theorem  I  it  follows  that  the  multiple  Fourier  series  is  absolutely 
summable  (v2,  8),  for  8  >p  -f  \(k  —  1),  whenever  fp(t)  is  of  bounded  variation, 
and  ipso  facto  summable  (v2,£);  in  fact,  we  can  state  something  more.  We 
can  lower  the  order  of  summability  to  8  >  p  —  1  +  %(k  —  I  ),  if  only  p>  I  . 
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THEOREM  III.  Iffp(t)  is  of  bounded  variation  in  0  <  t  <oo,  and  p^l,  then 

lim  S*R(x)  =  2* 


for 

Proof.  fp(t  -j-  0)  exists  for  every  t  >  0,  since  fp(t)  is  of  bounded  variation 
in  0<£<oo.  Also,  to  begin  with,  we  may  assume  tha,tfp(t)  is  bounded  and 
mono  tonic  and  fp(  +  0)  is  zero.  This  involves  no  loss  of  generality,  since  a 
function  of  bounded  variation  can  be  expressed  as  the  difference  of  two 
bounded  monotonic  functions. 

From  (2-3)  we  have,  for  p>l, 


for 

Let  /  =  cRk+**>[  P  4  r  +  TV  /i  +  /a  +  4»  (3-21) 

where  r)  is  such  that  |  fp(t)  \  <  e  for  1  1  \  ^  ?/  arid  any  arbitrary  e  >  0,  and  (t) 
is  sufficiently  large. 

Applying  the  second  mean-  value  theorem  to  Ilt  we  have 


where 

The  integral  on  the  right-hand  side  is  finite,  and  |  /p(?/  —  0)  |  <  e  so  that 

\Il\<c1e.  (3-22) 

Similarly 

^  -  «[/„(«  +  o)  rRz*+*»-%+tM(z)  dz  +/,(«>)  r  z^^-w^M(z)  dz~\  , 

L  J  w72  J  u>iR  J 

(3-23) 
where  <t)<o)l<co. 

We  know  that/p(6>  +  0)  and/p(oo)  are  finite,  and  that 


r  °°  k 
Jo*  +2p  1 


converges  for  $>p—  1  +  \(k—  1)  by  lemma  2,  so  that,  for  a  fixed  o)  the 
integrals  on  the  right-hand  side  of  (3-23)  can  be  made  less  than  an  arbitrary 
e  >  0  in  absolute  value,  for  R>R1.  Hence 

|/3|<c2e.  (3-24) 
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[ 
/' 


The  two  integrals  on  the  right-hand  side  can  be  made  less  than  any  arbitrary 
e  >  0  in  absolute  value,  provided  R  is  sufficiently  large.  Thus 


Also 


\I2rl\<c3e,     for 

|  /2,2  |  <  (jjr^i  Jj  /,(«)  I  q 


<C4€, 


(3-25) 
(3-26) 


for 

Combining  (3-25)  and  (3-26)  we  obtain  |  72  |  = 

3-3.  We  shall  now  prove  the  local  property  of  absolute  summability 
(va,  £)  for  multiple  Fourier  series. 

THEOBEM  IV.  Summability  \  p2,  d  \  ,  /or  £  >  J(fc  -f  1),  of  the  multiple  Fourier 
series  off(x)  at  any  point  depends  only  on  the  behaviour  of  the  function  in  the 
neighbourhood  of  that  point. 

Proof.  If  p  >  1,  using  the  reasoning  in  lemma  2,  we  obtain,  in  virtue  of 
relation  (3-12), 


=  \dR 


(3-31) 


=  \dR 


=  JdLR|Ji 


,     say. 
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Now,  ifp>  l,fp(t)  exists  for  almost  all  t,  so  that,  by  lemma  2,  we  have 


-1/1)<0  Vp+tk+s(tR)  dt 

=  4,1  +  4,2'     saJ>  (3-32) 

(1         \  f°°       dt 
jjzqpH+i)  J,  F^jW     by  Iemma  *> 

'     if    *>JP  +  «fc-l).  (3-33) 


If 


J, 


tMp~*\fp-*(t)\dt   r  ^,-1(0 

-  -  '  = 


by  lemma  1,  provided  d>p  +  ^(k—  1).  Therefore 


for 


and  so,  by  (3-32),  it  follows  that 


I 

J  y 


\J2\dJK<oo. 


(3-34) 


(3-35) 


From  (3-31)  and  (3-35)  it  follows  that  a  necessary  and  sufficient  condition 

|*00 

for  summability  |  v2,  S  |,  $>%(k—l),  is  that        \J1\dR<cot  which  proves 

J? 
the  theorem. 
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CANONICAL  FACTORIZATION  OF  PSEUDO  -UNITARY  MATRICES 

By  H.  C.  LEE 

[Received  2  September  1945  —  Read  16  November  1945] 

Introduction 
A  square  matrix  A  is  said  to  be  pseudo-unitary  if 


o  -/o  -/ 

(where  a  star  denotes  a  conjugate  transpose).  The  integers  p  and  q  will  be 
fixed  once  for  all.  When  A  is  real  it  is  called  pseudo-orthogonal.  Pseudo  - 
orthogonal  matrices  have  been  studied  by  many  writers.  f  Extending  Hsu's 
method  after  arriving  at  the  splitting  invariants  of  a  pseudo  -unitary  matrix 
(§1),  we  shall  obtain  in  a  very  short  way  a  preliminary  factorization  for 
such  a  matrix  (§2),  a  result  similar  to  one  of  Au  tonne's.  Introducing  the 
nucleus  (§3)  we  then  elaborate  a  canonical  factorization  (§4)  which  is 
unique  when  certain  submatrices  are  non-singular  (§5).  The  paper  ends 
with  an  application  to  the  theory  of  plane  factorization  (§  6). 

1.    The  splitting  invariants 
Split  A  in  the  block  form 


where  Al  is  p  x  p  and  A^  is  q  x  q.  By  (1)  we  have 
AI  AI  =  lp  -\-  AS  A$,     A^A^  —  lq  +  AzA^, 
The  first  two  relations  imply  that  A^  and  A±  are  non-singular,  since 


f  A.  Brill,  Das  Rdativitdtsprinzip,  Teubiier  (1912),  31.  L.  Autonne,  Ann.  Univ. 
Lyon,  38  (1915),  1-70.  H.  C.  Leo,  "On  plane  factorizations  of  pseudo -Euclidean 
rotations",  Quart.  J.  of  Math.  (Oxford  series),  15  (1944),  7-10.  P.  L.  Hsu,  "On  a 
factorization  of  pseudo -orthogonal  matrices"  (to  appear  elsewhere).  I  am  indebted 
to  Dr  Hsu  for  communicating  to  me  his  paper  in  manuscript  form. 
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Ip  +  A$AB,  for  example,  is  the  matrix  of  a  positive  definite  Hermitian  form; 
whence  the  third  relation  may  be  written 

A2  =  MA4>     A3  =  M*Aly  (3) 

where  M  is  a,p  x  q  matrix.  Substituting  these  in  the  first  two  relations  we  get 

We  have  now  to  make  use  of 

LEMMA  1 .  M  being  any  pxq  matrix  of  rank  r,  there  exist  a  unitary  matrix 
U  of  order  p  and  a  unitary  matrix  V  of  order  q  such  that 

U*MV  =  (A     °] , 

where  A  is  a  diagonal  matrix  with  r  diagonal  elements  which  are  the  square 
roots  of  the  non-zero  (and  so  positive)  characteristic  values  of  the  positive 
semi-definite  hermitian  matrix  M*M  (of  order  q,  rank  r).~\ 

Now  the  non-zero  characteristic  values  of  M*M  being  necessarily  less 
than  1  by  (4),  we  may  write  them  in  the  form  tanh2#,£  (i  —  1,  ...,r),  where 
the  0's  are  real  and  positive.  Lemma  1  then  gives 


We  find  by  (4) 


0\ 


A  A*  ~  F/cosh2  0i  +  •  • •  +  cosh2  °r       0  W* 

A*A*  ~  V(  o  iqj  v  ' 

We  observe  immediately  from  (6)  that  we  have 

THEOREM  1.  The  characteristic  values  of  the  hermitian  matrices  AlAf  and 
A^A^  (of  orders  p  and  q  respectively)  are  all  >  1,  and  those  of  these  values 
which  are  >  ] ,  namely,  cosh2  01}  . . . ,  cosh2  6r,  are  the  same  for  the  two  matrices. 

The  numbers  0lt  ...,0r,  as  well  as  the  integer  r,  which  were  first  defined 
by  means  of  the  auxiliary  matrix  J/,  receive  now  an  invariant  meaning, 
being  intrinsically  connected  with  the  matrix  A  in  the  manner  described 
in  theorem  1 .  We  therefore  introduce  the 

DEFINITION.    The  real  and  positive  numbers  0^...,0r  introduced  in 
theorem  1  are  called  the  splitting  invariants  of  the  pseudo -unitary  matrix  A. 
In  particular,  the  integer  r  is  a  numerical  invariant  of  A ,  and 

)•  (7) 


t  This  is  an  extension  of  a  known  result;  see  C.  C.  MacDuffee,  The  theory  of  matrices 
(1933),  78.  We  defer  the  proof  to  Appendix  1. 
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2.    Preliminary  factorization 

Returning  now  to  (6)  we  easily  see  that  A±  and  A4  are  of  the  form 
A    _  r7 /cosh  #!+...  4- cosh  0r       0  \ 

A*-u\  o  IPJ*> 

v  /cosh  6 1 4- . . .  4-  cosh  6r       0  \  v 

4=    I  o  /ajr>, 

where  X  and  F  are  unitary  matrices  of  orders  p  and  q  respectively.  By 
(3)wefindthat     ^    ^  ^ /sinh  ^  4. . . .  +  sinh  ^ 

_      /sinhtfi  :        ?--«i./>     n\        r  W 

Hence  (2)  may  be  written 

IT       u       fs       u  \ 
\   I  ft      T          o       n    i  /v     n\ 

(10) 


OV 

F; 


where          T  —  cosh  0L  +  .  .  .  4-  cosh  0r,     $  =  sinh  ^  4-  ...  4-  sinh  6r,  (11) 

and  we  have 

THEOREM  2.  A  factorization  of  the  pseudo-unitary  matrix  A  ,  whose  splitting 
invariants  are  #1}  ...,#r,  is  given  by  (10),  where  T  and  S  are  defined  by  (11) 
and  U,  X  are  unitary  matrices  of  order  p,  and  V,  Y  are  unitary  matrices  of 
order  g.f 

3.     The  nucleus 

We  may  push  on  to  a  deeper  analysis  of  the  factorization.  Let  0lf  ...,  6a 
be  the  distinct  members  among  the  splitting  invariants  (thus  s  ^  r)  and  let 
their  multiplicities  be  rl5  ..  .,  rs  respectively,  so  that 

r1+...+rs  =  r.  (12) 

Then  we  may  write  (11)  in  the  form 

T  =  cosh  0^4-  ...4-cosh087rs,     S  =  sinh01/f.1+  ...isinhtf^.     (13) 

Though  really  unnecessary,  we  shall  for  aesthetic  reasons  transform  the 
middle  matrix  on  the  right  of  (10)  by  interchanging  the  first  two  (block) 
rows  and  the  first  two  (block)  columns,  thus  obtaining  a  matrix  of  Autonne's 

form  00       0 

T     8        °       .  (14) 

0       8     T      0  V     ' 

0        00     Iq_ 
Cf.  C.  C.  MacDuffee,  loc.  cit.,  68. 


1945]  ON  PSEUDO-UNITARY  MATRICES  233 

In  doing  so  the  form  of  (10)  is  preserved,  since  only  the  columns  of  U  and 
the  rows  of  X  are  permuted. 

DEFINITION.  The  real  matrix  N  in  (14),  where  T  and  S  are  denned  by 
(13)  and  where  Oly...,0a  are  the  distinct  splitting  invariants  of  A  in  increasing 
order  and  with  respective  multiplicities  rl9  .  .  .,  rs,  is  called  the  nucleus  of  A. 

The  nucleus  N,  being  invariantly  connected  with  A,  is  evidently  uniquely 
determined.  It  is  (like  ^4)  a  pseudo-unitary  matrix,  satisfying  (1),  as  one 
can  see  at  once  by  direct  verification.  Since  N  is  real  it  is  also  a  pseudo- 
orthogonal  matrix,  of  a  special  form. 

Let  us  now  inquire  how  (to  what  extent)  N  is  unaltered  when  pre- 
multiplied  by  a  unitary  matrix  L  and  post-multiplied  by  a  unitary  matrix  R* 

LNR*  =  N.  (15) 

When  this  is  so,  the  right-hand  side  of  (10)  may  be  written 

We  prescribe  the  choice  of  L  so  that  when  it  multiplies  into  the  first  factor 
of  (16)  it  does  not  disturb  the  simplicity  of  that  factor  as  a  direct  sum,  and 
likewise  choose  R*  so  that  its  product  with  the  last  factor  of  (16)  is  a 
similar  direct  sum.  This  is  equivalent  to  supposing  that  L  and  R  are  them- 
selves similar  direct  sums  (i.e.  the  first  direct  summand  is  pxp  and  the 
second  qxq).  Therefore  we  write 


\  /JSn     7?12\  . 

"    *=Ut    lJ  + 


where,  for  L,  £n  is  (p-r)x(p-r),  7>22  isrxr,  L33  is  r  x  r,  L44  is  (q  —  r)  x  (q  —  r), 
and  similarly  for  R. 

Now  write  (15)  in  the  form 

LN  =  NR.  (18) 

By  (14)  and  (17),  (18)  may  be  written 

£12      0  0 
L2l     L22      0        0 

0        0  L33  L34 

0        0  L43 
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whence,  multiplying  and  equating,  we  have 

£ii  =  #ii>  ^12  ^  =  #12,  L12S=Q,  0  =  0, 

L2l  =  TR2l,  L22T  =  TR22,  L22S  =  &R33,        0  =  SRS±, 

0  =  SR2l,  L33S  =  SR22)  L^T  =  TR^  £34  =  TR^, 

0  =  0,  £43  #  =  0,  £43  T  =  £43,  £44  =  Ru. 

Since  $  is  non-singular,  four  of  these  relations  imply  at  once  LL2  —  0, 
7?34  =  0,  R2l  =  0,  £43  =  0;  then  another  four  give  R12  =  0,  L21  =  0,  Z/34  =  0, 
7?43  =  0;  thus  L  and  -K  reduce  to  block  diagonal  form 

L  =  £n  +  L23  +  L33  +  £44,     ^  =  ^  +  ^22  +  ^33  +  ^44.  (19) 


Since  T  is  also  non-singular,  the  remaining  relations  can  be  solved  for  the 
R's  in  terms  of  the  .L's; 

_  |J?22  =  T-^IT  =  ^f-1/^^ 

^11  ~  ^11,        ^^  =    fl-l^fl    =   ^-1^^  T?         ^44  =   ^44-  (20) 

Then  X33  =  ST^L^T8~l  =  TJS^L^ST-1. 


Since  T  and  /S  being  diagonal  are  permutable,  we  have  from  (21)  GL22  —  L22  G, 
where  Q  ^ 

If  therefore  we  write 


11    •••    ^ 


22,  1« 

; 

22,sl   •••  ^22,  st 

where  L22  u  is  rixri  (i  —  1  ,  .  .  .  ,  s)  ,  on  account  of  the  distinctness  of 
^!,  ...,tanh2^s,  the  equation  GL22  =  L22G  implies  immediately  that 
—  0  (i^k',  i,k  =  1,  .  .  .,  s),  so  that  L22  has  block  diagonal  form 

L22==A+---+4J  (22) 

where  for  brevity  we  write  L-t  for  L22ii  which  is  rt  x  ri  (i  =  1,  ...,«s).  Then 
(21)  becomes  Z/3S  =  L22,  and  (20)  gives  Rpp  -=  />^  (p  =  1,  2,  3,  4).  Thus,  by 
(19),  we  have 


B  =  L.  (23) 

We  state  the  above  result  in  the  following 

THEOREM  3.  //  L  and  R  are  unitary  matrices,  of  the  form  (17),  which  verify 
the  relation  (18),  then  they  are  of  the  form  (23),  where  Z>n,  L^  (i  =  1,  ...,5)  and 
JL44  are  unitary  matrices  (of  orders  already  indicated). 
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4.    Canonical  factorization 

A  further  factorization  can  now  be  elaborated  from  (16)  by  properly 
choosing  the  matrices  £n,  Lt  (i  =  1,  ...,5)  and  £44  which  appear  in  the 
constitution  of  L  and  R  in  the  manner  of  (23).  In  (16),  the  matrix  22*, 
which  equals  L*  by  (23),  may  be  absorbed  into  the  last  factor  resulting  in 
a  factor  of  the  same  form  (with  X  and  Y  changed),  while  the  matrix  L 
multiplies  the  first  factor  changing  U,  V  into 

V(LU      0\          /£*       0\ 

U  (  0      Lj'      V\0        Lj-  W 

To  perform  this  reduction  in  a  more  specific  fashion,  we  need 

LEMMA  2.  Every  square,  matrix  can  be  post-multiplied  by  a  unitary  matrix 
so  that  the  product  is  left-triangular  (i.e.  having  zero  elements  on  the  right  of 
the  principal  diagonal)  and  has  real  and  non-negative  diagonal  elements.^ 

Then  write 


m  KA  f/4,n  ...  iu      F_/K  v,\         (26) 

\u»   uj-\ut      •  i  /'    K     U  V'         (   } 


where  t/j  is  (#>  —  r)  x  (p  —  r),  U±  is  r  xr,  U^u  is  r€  x  yf  (i  =  1,  .  .  .,  ,9),  Vl  is  r  x  r, 
and  T£  is  (q  —  r)x  (q  —  r).  The  products  in  (24)  can  now  be  written 


We  may  then  choose  the  unitary  matrices  LIL,  Lt  (i 
accordance  with  lemma  2  such  that  the  products  U^  Zrll3  C/4  ti  L^  (i  —  1 , . . .,  s) 
and  V±L^  are  left-triangular  and  have  real  and  non-negative  diagonal 
elements.  We  suppose  this  reduction  already  effected,  i.e.  in  (25)  the  square 
blocks  Ult  U4iii  (i  =  !,...,#)  and  V^  are  left-triangular  matrices  having  real 
and  non-negative  diagonal  elements.  Hence 

THEOREM  4.   A  canonical  factorization  of  the  pseudo-unitary  matrix  A, 
whose  nucleus  is  N,  is  given  by 


f  We  again  defer  the  proof  to  Appendix  2.  Cf.  also  H.  W.  Turnbull  and  A.  C.  Aitken, 
An  introduction  to  the  theory  of  canonical  matrices  ( 1932),  96  (Schmidt's  theorem). 
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where  U,  F,  X,  Y  are  unitary  matrices  (of  orders  p,  g,  p,  q  respectively),  and 
if  we  write  U,  V  in  the  form  (25)  the  blocks 

&i>     U*,u     (*  =  1,  ...,*),    F«  (27) 

are  left-triangular  matrices  having  real  and  non-negative  diagonal  elements. 

To  count  the  number  of  real  parameters  in  the  canonical  factorization 
(26),  note  that  a  general  unitary  matrix  of  order  n  depends  on  ri2  real  para- 
meters. Thus  X  and  Y  together  depend  on  #2  +  #2  real  parameters.  If  U 
and  V  were  general,  they  would  also  involve  p*  +  q2  real  parameters;  but 
we  have  used  jLn,  Lt  (i=l,...,s)  and  LM,  which  together  contain 


(p—rY  +  S^-Hfe  —  r)2  real  parameters,  to  remove  the  same  number  of 

i-i 

real  parameters  in  Ult  U^u  (i  =  1,  ...,«)  and  ^.   Finally,  the  nucleus  N 
contains  s  real  parameters  (namely,  019  ...,0S).  Hence 

COROLLARY  1.  The  canonical  factorization  (26)  depends  on 


l)  (28) 

i=l 

real  parameters. 

In  the  extreme  case  when  r  —  min(p,  q)  and  the  splitting  invariants  are 
all  distinct,  s  —  r,  ri  —  1  (i  =  1,  ...,s),  the  number  (28)  attains  its  maximum 
value  (p  +  #)2  which  is  the  number  of  real  parameters  in  a  general  pseudo- 
unitary  matrix  (of  order  p  -f  <?)• 

While  formal  results  in  the  complex  case  are  valid  also  in  the  real  case, 
the  counting  of  parameters  is  very  different  since  in  the  real  case  an  ortho- 
gonal matrix  of  order  n  depends  on  %n(n—  1)  parameters  only.  Hence, 
instead  of  corollary  1  ,  we  now  have 

COROLLARY  2.  In  the  real  case  the  canonical  factorization  (26)  depends  on 


-l)  (29) 

parameters. 

In  the  extreme  case  mentioned  above,  the  number  (29)  reaches  its  maxi- 
mum %(p  +  q)  (p  +  q—  1)  which  is  the  number  of  parameters  in  a  general 
pseudo-orthogonal  matrix  (of  order  p  -f  q). 
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5.    Uniqueness 
We  wish  to  prove 

THEOREM  5.   The  canonical  factorization  (26)  in  theorem  4  is  unique  when 
the  triangular  blocks  (27)  are  non-singular. 

We  have  seen  that  the  nucleus  -ZV"  is  unique.  Let 

IV     0\        X     0\         C     0(X     0\ 


where  ff,  F,  X,  T  are  subject  to  the  same  conditions  as  U,  V,  X,  Y  in 
theorem  4.  Then  we  have  a  relation  of  the  form  (18),  where  now 

0      0\*/U      0\       /V*U         0 

?Ho    v)  =  (  o       F* 

0\  /JC      0\*       (XX*         0 

fjlo    FJ  ={  o      fr 

The  uniqueness  will  be  proved  if  we  can  show  that  both  L  and  R  here  reduce 
to  the  identity  matrix. 

Since  by  (30)  L  and  R  are  unitary  matrices  of  the  form  (17),  theorem  3 
implies  that  they  have  the  form  (23).  Comparing  (30)  with  (17)  we  obtain 


F, 

We  have  in  particular       U^  =  t\  £n,     ^4  =  "t^L^,  (31) 

and  also  C74  =  C^  L22  which  is  further  analysed  into 


tu*,u  ...  u^\ 
W«.rt  -  ^,J 


V 

which  gives  in  particular 

^Mi=^MiA      (»==!,...,*).  (32) 

Remember  that  in  (31)  and  (32)  the  blocks  Lu,  Lt  (i  =  !,...,<$)  and  I/44  are 
unitary  matrices,  and  the  other  blocks  are  just  the  left-triangular  matrices 
which  we  assume  to  be  non-singular  and  which  therefore  have  real  and 
positive  diagonal  elements.  At  this  point  we  need 

LEMMA  3.   Let  W  and  W  be  two  left-triangular  matrices  of  equal  order, 
each  having  real  and  positive  diagonal  elements.    If  K  is  a  unitary  matrix 
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of  the  same  order  such  that  W  —  WK,  then  K  can  only  be  the  identity 
matrix.^ 

Applying  this  lemma  to  (31)  and  (32)  we  see  that  Ln,  Lt  (i  —  !,...,«) 
and  LU  all  reduce  to  identity  matrices.  So  do  L  and  R  by  (23).  This  com- 
pletes the  proof. 

6.    Application  to  the  theory  of  plane  factorization 

In  the  real  case  there  is  a  worked  theory  of  plane  factorization  which 
asserts  that  every  orthogonal  matrix  of  order  n,  determinant  4-  1,  is  factorable 
into  a  product  of  \n(n  —  1)  plane  rotations,  $  and  that  every  pseudo-orthogonal 
matrix  A  of  type  (1),  determinant  +  1,  is  factorable  into  a  product  of 

%(p  +  q)(p  +  q-l)-mm(p,q)  (33) 

plane  rotations  and  mm  (p,  q)  (34) 

pseudo-plane  rotations.^  However,  the  factors  can  be  reduced  in  number 
according  to  the  structure  of  the  given  orthogonal  or  pseudo  -orthogonal 
matrix,  this  being  clear  from  the  method  of  factorization  ||  shortened  by  the 
theorem  in  Appendix  2.  Now,  we  propose  to  see  how  much  the  numbers 
of  factors  can  be  reduced  for  the  pseudo  -orthogonal  matrix  A  basing  on  the 
canonical  factorization  (26). 

The  matrices  U,  F,  X  ,  Y  in  (26)  are  in  the  present  case  orthogonal  matrices 
and  have  therefore  determinants  ±  1  .  We  wish  first  to  point  out  that  these 
determinants  can  be  made  all  equal  to  -\-  1  ,  possibly  at  the  expense  of  changing 
the  sign  of  at  most  two  rows  and  two  columns  of  the  nucleus  N  on  the  right 
of  (26).  To  see  this  let  /^^  stand  for  the  matrix  obtained  from  In  by  replacing 
its  first  diagonal  element  1  by  ±1,  thus 


If  we  denote 

det  U  =  u  =  ±  1,     detZ  =  x  =  ±  1,     det  F  =  v  =  ±  1,     det  7  =  y  =  ±  1, 
we  may  write  (26)  in  the  form 

m    o\  (iy    o  \  //£>     o  \    //£>     o  \  //<?     o  \  /z    o\ 

^~lo      F/\0      n*)\o       1$>)*\Q       /fMo       If)\0      Y] 


t  For  proof  see  Appendix  3. 

J  F.  D.  Murnaghan,  The  theory  of  group  representations  (1938),  320. 
§  H.  C.  Lee,  "On  plane  factorizations  of  pseudo  -Euclidean  rotations",  Quart.  J. 
of  Math.  (Oxford  series),  15  (1944),  8. 

I!  F.  D.  Murnaghan,  loc.  cit.  319-320;  Lee,  loc.  cit.  8-10. 
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where  U,-UPj*,    V0  =  Vlf,     X0  =  f?X,    T0  =  I^Y,  (36) 


It  is  clear  from  (36)  that  U0,  VQ>  X0,  YQ  are  orthogonal  matrices  of  determinant 
-f  1,  and  from  (37)  that  N0  is  obtained  from  N  by  at  most  changing  the  sign 
of  the  first  and  the  (p  +  l)th  rows  and  the  first  and  the  (p  +  l)th  columns. 

N0,  like  N,  is  also  a  special  pseudo  -orthogonal  matrix.  By  (35)  we  have 
det-4  =  detN0.  A  as  a  pseudo-orthogonal  matrix  has  determinant  ±1. 
We  shall  suppose  det^4  =  +  1,  thus  also  det^  =  +  1. 

What  is  essential  for  us  for  the  present  is  only  the  fact  that  N0,  like  N, 
has  only  r  non-zero  elements  on  the  right  of  the  principal  diagonal,  and 
these  elements  are  all  found  after  the  ^>th  column.  Hence,  the  method  of 
factorization  (shortened  by  the  theorem  of  Appendix  2)  tells  us  that  N0 
is  a  product  of  exactly  r  pseudo-plane  rotations. 

Moreover,  if  we  write  (35)  in  the  form 


0\       p0      (Aft,      0\ 

F0r°U  /JU  yj-          (38> 


F0 

and  take  account  of  the  special  shapes  of  £70  and  VQ  (see  (36)  and  (25)  and 
remember  the  triangular  blocks),  the  same  shortened  method  informs  us 
that  the  number  of  plane  rotations  in  factorization  is  at  most 

lp(p-l)-l(p-r)(p-r-l}-^ri(ri-l)     for     U0, 

*"1  for    F0, 

for     X0, 


These  are  respectively  the  numbers  of  factors  when  the  four  matrices  of 
(38)  besides  N0  are  factorized.  Hence 

THEOREM  6.  The  pseudo-orthogonal  matrix  A  of  determinant  unity,  whose 
invariant  multiplicities  are  rl9  ...,rs  (rt+  ...  4-rs  =  r),  can  be  factorized  into 
a  product  of  at  most 


i)-(p-r)(q-r)-  SW^+i)  (39> 

i=l 

plane  rotations  and  r  (40) 

pseudo-plane  rotations. 
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The  two  integers  (39)  and  (40)  do  not  exceed  the  two  integers  (33)  and 
(34)  respectively.  Theorem  6  is  therefore  an  invariantive  refinement  of  the 
known  result.  In  the  extreme  case  when  r  =  min  (p,  q)  and  the  splitting 
invariants  are  all  distinct,  s  —  r,  ri  —  1  (i  —  1,  ...,#),  the  two  numbers  (39) 
and  (40)  reach  their  maxima  (33)  and  (34)  respectively. 

Finally,  we  add  that  a  parallel  theory  can  also  be  developed  for  the 
complex  case. 

Appendix  I 

Proof  of  lemma  I .  There  exists  a  unitary  matrix  V  of  order  q  such  that 


V*M*MV  = 


0\ 

o/- 


If  <I>  and  T  denote  the  matrices  formed  by  the  first  r  and  last  q  —  r  columns 
of  V  respectively:  V  =  (O5T),  then 

V*M*MV  =  ( 


_ 
P*  *  )  ~ 

whence 

®*M  *Jf<D  =  A2,    ®*M  *M*¥  =  0,     ¥*#  *JfO  =  0,     Y*Jf  *APF  =  0. 
Of  these  four  relations  the  last  may  be  written  (M  *F)*  (M*¥)  —  0,  implying 

JfT  =  0     (and  so  also  T*Jf*  =  0); 

the  middle  two  are  then  automatically  satisfied,  and  the  first  may  be  written 
Q*a  =  /r,     where    O  = 


Thus  n  is  a  p  x  r  matrix  whose  r  columns  are  unitary-orthogonal  to  one 
another.  Let  U  be  a  unitary  .matrix  of  order  p  whose  first  r  columns  are 
just  those  of  Q:  U  =  (Q,  T).  Then 


U+M  V  =  Jf  (<D,  T)  =  (J/<£,  0) 

=  /a*jifo   o\  _  /H*QA   o\  _  /A   o\ 
~  \r*Jtfo   o  j  "  \r*aA    o/  ~  \o    o;  ' 

vanishes  by  the  unitary  property.  Lemma  1  is  proved. 


Appendix  2 

For  simplicity  here  and  for  applications  to  §  6  we  first  confine  ourselves 
to  the  real  case.  We  call  an  n  x  n  orthogonal  matrix  P  of  determinant  +  1 
a  plane  rotation  if  the  corresponding  linear  transformation  affects  only 
two  variables. 
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THEOREM.  Let  Z  be  a  real  square  matrix  of  order  n  having  N  non-zero 
elements  on  the  right  of  its  principal  diagonal.  There  exist  N  plane  rotations 
P1?  ...,-FJv  such  that  the  product  ZP^...PN  has  exclusively  zero  elements  on 
the  right  of  its  principal  diagonal. 

The  proof  of  this  theorem  is  essentially  given  by  Murnaghan.f  The  non- 
zero elements  of  Z  on  the  right  of  its  principal  diagonal  are  annulled  succes- 
sively by  properly  choosing  the  P's. 

Proof  of  lemma  2.  The  above  theorem  can  evidently  be  carried  over  to 
the  complex  case,  with  the  result  that  Z  (now  complex)  can  be  post-multi- 
plied by  a  unitary  matrix  so  that  the  product  is  left-triangular.  If  the 
diagonal  elements  of  this  product  are  written  in  the  usual  polar  form 
/Ojg^i,  ...,/one**»,  a  further  post-multiplication  by  the  diagonal  unitary 
matrix  e~i(f>i  +  . . .  4-  e~tcp«  makes  the  diagonal  elements  equal  to  plt  . . . ,  pn 
which  are  real  and  non-negative.  Lemma  2  is  proved. 

Appendix  3 

Proof  of  lemma  3.  Since  W  is  non -singular,  W~*  exists  and  is  likewise  a 
left-triangular  matrix  having  real  and  positive  diagonal  elements.  The 
product  K  =  J?^"1}^  of  two  such  matrices  is  again  such  a  matrix,  and  such 
a  matrix  if  unitary  must  be  the  identity  matrix.  Lemma  3  is  proved. 
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t  F.  D.  Murnaghan,  loc.  cit.  319. 
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AN  EXTENSION  OF  A  TAUBERIAN  THEOREM  OF 
L.  J.  MORDELL 

By  S.  MINAKSHISTTNDARAM  and  C.  T.  RAJAGOPAL 
[Received  3  December  1944 — Read  25  January  1946] 

1.  This  paper  seeks  to  extend  a  Tauberian  theorem  of  L.  J.  Mordell 
involving  the  first-order  Cesaro  mean  (9)*  by  substituting  for  the  Cesaro 
mean  a  Riesz  mean  of  any  order  r  >  0.  Three  points  regarding  the  extension 
may  be  mentioned  at  the  outset,  (i)  The  particular  case  r  =  I  of  the  exten- 
sion seems  to  be  superior  to  a  similar  extension  by  N.  Higaki  (5,  theorem  IV), 
in  that  it  serves  to  amplify  the  familiar  Hardy-Landau  convergence  theorem 
for  series  summable  by  the  first  Riesz  mean  (6,  3 If.),  (ii)  The  general  case 
r  >  0  of  our  extension  shows  how  one  of  K.  Ananda  Rau's  theorems  on  series 
(1,  theorem  4  and  8,  theorem  1;  also  2,  theorem  2)  may  be  restated  with  a 
one-sided  restriction  on  the  terms  of  the  series  instead  of  his  two-sided 
restriction,  (iii)  The  case  r  >  I  of  the  extension  is  proved  by  a  method  which 
H.  D.  Kloosterman  (7)  suggests  for  obtaining  a  generalization  of  Mordeirs 
theorem  with  a  Cesaro  mean  of  any  integral  order  r  >  I  .f 

00 

1-1.   Notation.  S  aw  is  a  real  series  with  partial  sums 

n=0 

*»=  S«,     fa  =  0,1,  2,...). 

v=0 

{AjJ  is  a  sequence  such  that 

0<A0<A1< ...  <An->oo     (?i->oo). 


*  Arabic  numerals  in  thick  type  enclosed  within  brackets  indicate  references  given 
at  the  end  of  the  paper.  We  are  indebted  to  the  referee  for  drawing  our  attention  to 
the  paper  of  Bosanquet  (2)  and  Doetsch  (3). 

t  Perhaps  the  chief  interest  of  the  proof  lies  in  the  fact  that  it  preserves  Mordeirs 
ideas  in  Kloosterman's  form.  Every  other  consideration  is  in  favour  of  the  neat 
alternative  proof,  based  on  some  identities  of  Bosanquet  (2),  which  has  been  kindly 
suggested  to  us  by  the  referee.  This  proof,  outlined  in  a  subsequent  footnote,  dispenses 
with  the  need  for  separate  analyses  in  the  cases  Q^r<l  and  r  >  1. 
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o*(<i))  is  the  rth  Riesz  mean  of  {sn}  with  respect  to  {An},  defined  by 


where  we  have  written  A(t)  =  A°(t)  =  sn  (An  ^  t  <  An+1). 
For  any  function  F(x)  of  a;  and  9  >  0  we  set 


=  S  (  -  1)'        F(x  -  »<?). 

v==o  V/ 

2.   THEOREM  1.   Hypotheses: 

(i)  <j>(x),  *IT(X)  are  positive  functions  ofx>0  such  that 

6(x)  =  {$(x)W(x)yKr+V  ^  x     (r  >  0).  (2-1) 

(ii)  <j>(x)  is  non-decreasing  \  there  are  constants  h,  H  such  that 

(2'2) 
(2'3) 


for  \  x'  —  x  \  ^  rj&(x),  y<\  being  a  given  constant. 

(iii)  ^-A^^AJ.  (2-4) 

(iv)  oJ(A.  -  A^)  <  Kir  (An),  (2-5) 

(v)  |4«-(w)|<<50(w),  (2-6) 

where  K,  d  are  positive  constants. 
Conclusion'.  If&(x)  is  defined  by 


then  A(v)  =  O[&(a>)]     (w->oo).  (2-7) 

2  •  1  .    $0we  lemmas.  The  proof  of  theorem  1  requires  the  following  lemmas, 
of  which  the  first  is  taken  from  Hardy  and  Riesz  (4,  lemma  8): 

LEMMA  1  ,  IfQ^£^a)  and  p  denotes  the  greatest  integer  less  than  r  >  0,  then 


f  (a)-t 
Jo 


Q2 


244  S.  MINAKSHISUNDARAM  and  C.  T.  RAJAGOPAL  [25  Jan. 

LEMMA  2.  Under  the  hypotheses  (2-1),  (2-2),  (2-3),  (2-4)  and  (2-5)  oftheoreml, 
A(<i>)-A(t)<KH(e  +  ii)&(Am)  (2-11) 

provided  that       Am  -  eO(  Am)  <  t  <  Am  <  w  <  Am+1     (0  <  e  <  ??)  . 

Proof.   Let  A3^Am-e0(AJ<Aa+1     (?<™),1 

V  (2-12) 

Av<^<Ay+1     (<7+l<v+l<ra)J 

Then,  if  v^q+  1,  we  have,  using  in  succession  (2-5),  (2*3),  (2*12), 
A(u)-A(t)  =  sm-sv  =  av+1  +  av+z+...+am 


)  <  KH(Xm  -  A 

(2-  11  a) 

Next,  if  y  =  g,  using  successively  (2-1  la),  (2-5),  (2-4),  (2-2),  (2-3),  we  get 
A(w)-A  (t)  =  sm  -  sQ+1  +  atf+1 


since,  on  account  of  (2-2)  and  (2-3),  0(Ag+1)  <  H^(\m)  and  ^(AQ+1)  ^  H\Ir(Xm) 
which  taken  together  give  #(Ag+1)  <  H$(Xm). 

(2-  11  a)  and  (2-116)  together  establish  (2-11). 

It  may  be  remarked  that  (2-  1  1  )  is  a  fortiori  true  when  the  proviso  attached 
to  it  is  replaced  by 


LEMMA  3.  Under  the  hypotheses  (2-1),  (2-3)  and  (2-5), 

A(t)-A(a>)^KHe$(Xm)  (2-13) 

provided  that  Xm^ito<t^iXm-\-  e6(Xm)     (0< 


Proof.  Suppose  (as  we  may  without  loss  of  generality)  that  o)  <  Aw+1. 
Suppose  further  that 

A       e^A<A          m 
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Then,  if  m  +  I  ^v^q,  using  successively  (2-5),  (2-3),  (2-14),  we  find 


)  +  .  .  .  +  (A,  - 
<  KH(XV  - 


Since  this  relation  is  trivial  for  y  =  m,  the  proof  of  (2-  13)  is  complete. 
LEMMA  4. 

(X    F(x-t)^0(t)dt  =  ^r    F(x-t)G(t)dt,  (2-16) 

J  X—  qp  J  OJ-9 

fX+V(a?  +  9-OAjG(*)(ft  =  A^r+V(^  +  9-0^)^          (2-16) 

J  X  J  X 

where  F(t),  O(t)  are  integrable  functions  for  the  values  of  t  in  question. 
The  lemma  is  easily  proved  by  induction. 

LEMMA  5.  Under  the  hypotheses  (2-1),  (2-2),  (2-3),  (2-4)  and  (2-5), 

A1+M*(*)  <  I>+  I)KH(e  +  v)  <p"#(Aw),  (2-17) 

provided  that  Am  —  e^(A7n)  ^a;--(i>+l)9<#^Aw; 

am2  /w^Aer  AJ+1  ^^(a;)  <  r(v  H-  1  )  KHeyft^)  ,  (2-18) 

provided  that 


Proof.    (2-17)  and  (2-18)  follow  by  induction  from  (2-11)  and  (2-13) 
respectively,  since 


rx  r(v±  i]  cx 

(x)  =  A'_,  v        A"~\t)  dt  =  i^J         A1^-M<)  *, 

Ja:--<p  A  V^^      JJC-<P 

rx+<p  Fft'-i-  I'i  f^+v 

(X)  =  A^      4-HO  «a  =  -W^-2      A^ 

Jx  L\v)     J  x 

LEMMA  6.  For  any  positive  integer  pt 

+  l)<?*>-1  r    A(t)dt 

Jx-9 

Jx  ^>- 

(*-*  +  ?)  S 
1-9  »"= 


where  the  second  expression  on  the  right  side  is  absent  if  p  =  1. 
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Proof. 

A*>~l(t)dt 


(2-19) 
The  lemma  is  now  established  by  using  (2-19)  as  a  reduction  formula. 

LEMMA  7.  ^or  any  positive  integer  p> 

/•aH-9 


/• 
J 

J#+<p  ^~2 

(*  +  9  -  *)  S 
x  ^=o 


e  second  member  of  the  right  side  being  absent  when  p  —  1. 
The  proof  is  similar  to  that  of  lemma  6. 

2-2.    Proof  of  theorem  Ifor  0<r<l.  First  we  prove  that  A(v) 
s  6>-»-oo. 

and  9  =  e#(AOT),  we  have 


AOT-<p 

=  JiH-J2     (say).  (2-21) 

From  lemma  1  with  p  =  0,  (2-6),  and  the  fact  that  <j)(x)  is  non-decreasing, 
we  get 

K|<M  max 

r 


(2-22) 
By  lemma  2,  Jt<KH(e  +  ^&(Am).  (2-23) 


1945]  A  TAUBERIAN  THEOREM  OF  L.  J.  MORDELL  247 

Employing  (2-22),  (2-23)  in  (2«21)  we  get 


(2-24) 


Now  the  restriction  on  a)  in  conjunction  with  (2*4)  gives 

Am+i  ~  ^('Wl)  <  Am  <  «><  Am+l, 

and  so,  on  account  of  (2-3), 


This,  taken  along  with  the  fact  that  ^(Am)  <^(o>),  gives 


which,  employed  in  (2-24),  ensures  that 

2     +  jr(e  +  *)0M.  (2-25) 


After  this  it  is  sufficient  to  prove  that  A(v)  =  OL[$(a))'].  If,  as  before, 
/^  ^  a)  <  Am+1  and  9  =  e(9(Aw)5  we  have 


=  /3-^     (say), 
where,  in  virtue  of  lemma  1  with^  =  0,  (2-6)  and  (2-2), 


(2-27) 
Also,  on  account  of  lemma  3, 

4^  r 

Prom  (2-26),  (2-27)  and  (2*28)  we  obtain,  arguing  as  before, 
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2-3.  Proof  of  theorem  lforr>l.  In  this  case,  taking  the  greatest  integer 
less  than  r  to  be  p,  we  have  p  ^  1  and  r  —  p>  0. 

First  we  can  show  that  A(a))  =  OR[&((t))]  starting  from  the  following 
relation  equivalent  to  lemma  6  : 


where  we  take  Am^o><  Am+1  and  (p+l)<?  =  e#(Am).  If  we  multiply  both 
sides  of  (2-31)  by  (Am  —  x)r~~p~l  and  integrate  with  respect  to  x  from  A^,  —  9 
to  Am,  we  have 


{A(a))-A(t)}dt 


=  ^  +  4  +  7     (say).  (2-32) 

In  virtue  of  (2-  15), 


Treating  each  term  in  this  expression  as  we  did  J^  in  §2-2,  we  find  that 
lemma  1  helps  to  establish 


2*  S        #(A.)  <         r     2P«£r^(A.).         (2-33) 


By  (2-11),  J^X^e  +  v)^^),  (2-34) 

and  by  (2-17),  /^z-frte  +  ^L^AJ.  (2-35) 
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From  (2-32),  (2-33),  (2-34)  and  (2-35),  we  get 


whence,  remembering  that  9  ==  e^(Am)/(^>  4-1),  we  are  led  first  to 


and  then,  as  a  result  of  #(AJ  ^  (Hlh)ri<r+»&(a)),  to 

(m  r/(r+l)  f  TVr  _  AJ  _i_  1  \  #        -0+1  \ 

f)          ^(p+lYl~(^^~  +  ^K(e  +  r,)^).      (2-36) 


To  prove  that  A(OJ)  —  OL[^(a))]  we  start  from  the  following  immediate 
consequence  of  lemma  7  : 

\V  Ap(r\  f#+9 

<f>2>-4(&))  =  rS^iy  ~  ^  J  .    {4(<)  ~  A(o})}  dt 


where  we  choose  Am^<y<Am+1  and  (^+1)9  =  e^(Am).  Multiplying  both 
sides  of  (2-37)  by  (Am  +  9  —  x)r-p~l  and  integrating  with  respect  to  x  from 
Am  to  Am  +  9  we  then  obtain 


+*  (Am  +  9  -  */-»-»  <fa  f  "+  V(<) 

Jx 

fA^+9  fa+9 

-f        (A.  +  .p-*)—  -icfaf      (»+9-0 

JAW  Jx 


=  J3_J4_jT'     (say).  (2-38) 

From  this,  reasoning  as  in  the  first  half  of  our  proof,  but  using  (2-16)  in 
place  of  (2-15),  (2-13)  in  place  of  (2-11)  and  finally  (2-18)  in  place  of  (2-17), 
it  is  not  difficult  to  conclude  that 

).       (2-39) 
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It  may  be  observed  that,  if  p  =  1,  /is  absent  from  (2-32)  and  /'  from 
(2-38);  nevertheless,  (2-36)  and  (2-39)  continue  to  hold  withp  =  1.* 

2*4.  Modifications  of  theorem  1.  (i)  Dropping  the  condition  involving  h  in 
(2-3),  we  can  still  prove  that  A(\n)  =  O[#(AJ]. 

(ii)  In  the  case  in  which  r  is  a  positive  integer,  we  can  conclude  that 
A(\n)  =  O[#(An)]  dropping  the  condition  involving  h  as  well  as  the  condition 
that  <j>(x)  should  be  non-decreasing.  For,  the  condition  mentioned  last  is 
required  only  when  changing  from  Aw  to  6>  in  A(\n)  —  0[$(An)]  and  when 
applying  lemma  1 .  The  application  of  this  lemma  becomes  superfluous  if 
r  is  a  positive  integer,  since  we  can  work  then  with  (2-31)  instead  of 
(2-32),  and  with  (2-37)  instead  of  (2-38),  replacing  p  by  r  and  x  by  Am. 
Mordell's  theorem  (9)  thus  appears  as  the  particular  case  An  =  nt  r  =  1  of 
our  modification  (ii),  apart  of  course  from  his  multiplying  factor  C  in 
\A(n)\<C&(n). 

3,  Remarks  on  theorem  1.  (i)  We  can  replace  the  hypothesis  6(x)  ^x  by 
6(x)  =  O(x)  as  #->oo,  introducing  certain  obvious  modifications  elsewhere. 

(ii)  If  0(x)  ^x,  we  can  replace  the  relation  between  x'  and  x  in  (2-2)  and 
(2-3),  viz.  \x'-x  \^e(x)9  by  |  (x'-x)/x  \  ^. 

*  We  sketch  below  the  referee's  alternative  proof  mentioned  in  an  earlier  footnote. 
In  the  proof  of  A(OJ)  =  OR[&(a))]  we  could  use,  instead  of  lemma  6,  the  identity 

rx          /•<!          rtt          rtp-i 

)\      dtA      dtA      ... 

J  x—<f>       J  <!  —  <p       J  «»— 9      J  tp—i— <p 

proved  easily  by  induction,  employing  for  example  lemma  4.  (2-31 )  would  then  become 

Ap  Av(x\       Cx  rtl  r^p-i 

V*A(a>)  =  -^— VT'  +  dtA         ...  {A(a>)-A(t)}dt, 

AP+l)          J  x-9        J<i-9        J  tp-i-V 

and  (2-32)  would  become 


/•AM  rx  fh  ft?-! 

(Xm-xy-»-idx          dtA         ...  {A(u)-A(t)}dt 

J  Am—  f  J  X~V        ^  <l  —  9        J  tp-l—ff 


=  J\  +  *7,     where     J  =  J2  +  /. 
Hence,  in  place  of  (2-34)  and  (2-35),  there  would  be  the  relation 


r—p 

which  enables  us  to  prove  (2-36),  not  only  dispensing  with  lemma  5  but  also  with  a 
better  constant  factor  multiplying  $(<w)  in  (2-36). 

Our  proof  of  A  (a))  =  OL[#(w)]  can  of  course  be  modified  along  similar  lines. 
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(iii)  That  some  such  condition  as  0(x)  =  O(x)  is  essential  to  the  truth  of 
theorem  1  is  brought  out  by  the  following  example.    If  r  =  1,  An  =  n> 
an—  1/Ti  =  i/r(n),  then  A(n)  =  a^  +  az  +  . . .  +  an ~ log n,  whence 
Al(n)  =  A(l)+A(2)+...+A(n)~nlogn  =  (j>(n) 
and  consequently  A(n)/-d-(n)~^(logn)->ao.  In  this  example 

6(x)jx  =  *J(logx)->co 
which  accounts  for  the  fact  that  A(n)  =  O[&(n)~]  does  not  hold. 

4.  Deductions  from  theorem  I.  Suppose,  in  (2-6),  d  can  be  made  as  small 
as  we  like  for  all  sufficiently  large  a>.  Then,  in  the  case  of  integral  r,  arguing 
as  in  §2-4(ii)  and  without  appealing  to  lemma  1,  we  can  put  d  —  e?+1  in 
(2-36)  and  (2-39)  whenever  w^w0(e).  In  the  case  of  non-integral  r,  we  can 
do  the  same  in  (2-25),  (2-29),  (2-36)  and  (2-39)  appealing  to  the  following 
lemma  of  Riesz*  where  we  formerly  appealed  to  lemma  1. 

LEMMA  1A.  //0<;£^a>  and  p  denotes  the  greatest  integer  less  than  non- 
integral  r>0,  then  AT^)  =  o[0(w)],     o>-»oo, 
where  $((*))  is  a  positive  non-decreasing  function,  implies 
rz 
Jo 
uniformly  with  respect  to  £. 

Thus,  when  d  in  (2-6)  and  ?/  in  (2-4)  can  both  be  made  arbitrarily  small 
for  all  large  a>  and  nt  theorem  1  yields 

COROLLARY  1-1.    //,  in  theorem  1,  (2-4)  and  (2-6)  are  replaced  by 


respectively,  (2-7)  will  be  replaced  by  A(a>)  =  o[#(o>)]. 

The  conclusion  in  the  above  corollary  can  be  substantially  ensured  even 
without  (2-4)  provided  that,  in  the  argument  leading  to  (2-  1  1  b), 


as  q->co.  This  is  done  in 

COROLLARY  1-2.    //,  in  theorem  1,  (2-6)  is  replaced  by  Ar(a))  =  o[0(o>)], 
(2-5)  is  supplemented  by  liman/^(A/i)^0,  (2-4)  and  the  condition  involving 

n-*-oo 

h  in  (2-3)  are  left  out,  (2-7)  will  become  A(An)  =  o[#(AJ].t 

*  Riesz  (10,  121)  states  his  lemma  in  the  form:  Let  functions  f(x),  <f>(x)  be  defined 
for  positive  x  so  that  the  first  is  of  bounded  variation  in  any  finite  interval  and  the  second 

rx 
is  positive  non-decreasing;  letQ<a<l.  Then  I    (x  —  t)*-lf(t)  dt  =  <?[$(#)],  x  ->  oo,  implies 

/£,  J  o 

(x  —  ty-ifWdt  =  o[6(x}\  uniformly  with  respect  to  £  in  (0,o;). 
0 
|  Cf.  Szasz's  convergence  condition  for  series  summable-JR(An,  1)  (6,  36). 
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As  a  particular  case  of  corollary  1-1  we  have 

COROLLARY  1-3.  Let  an/(An-Aw_1)^JK:A5,  Ar(<*))  =  0(0^),  where  r>0, 
0<£^a  +  r+l.  ThenA(aj)  =  o(Gt*r+»+*>)  provided  Xn-Xn_±  =  o(A<f-a>^+1>). 

This  corollary  can  of  course  be  suitably  modified  by  substituting  "0" 
conditions  for  the  "o"  conditions.  It  is  a  reformulation  of  Ananda  Rau's 
theorem  referred  to  in  §  1,  with  a  restriction  on  {An}  and  the  replacement 
of  the  "0"  condition  on  an  by  the  corresponding  "  OR"  condition.* 

Corollary  1-1  with  $(x)  —  {xg(x)}r,  r  >  0  and  i/r(x)  =  l/xg(x)  gives 

COROLLARY  1-4.  Let  g(x)  be  a  positive  function  of  x>0  such  that  xg(x)  is 
non-decreasing  and  O(x)  as  X-+CQ.  Also,  let  Q<h^g(x')/g(x)^H  for 
|  (x'-x)/x  |  ^<  1.  Then  aJ^-A^K^/A^AJ  and  <rr(a))  =  o[{g(a))}r], 
r  >  0,  together  imply  A(a))  =  o(l),  provided  that  (^n  —  ^n^i)/^n  ~  °[ff(^n)^- 

The  case  r  =  1  of  the  corollary  bears  to  the  Hardy-Landau  convergence 
theorem  mentioned  in  §  1  a  relation  which  may  be  expressed  thus.  In  the 
result:  an/(Aw-Ayt_1)^^(An)  and  o-HAJ  =  o(l)  together  imply  sn  =  o(l), 
when  (An  —  An_!)/An  =  o(l),  we  know  that  fr(x)  =  l/x  and  cannot  be  replaced 
by  x(x)lx  if  #(#)->  oo.  The  corollary  however  shows  that  when  the  rapidity 
of  convergence  of  o"1(An)  to  0  is  taken  into  account  by  postulating 


we  can  have  i/r(x)  =  l/xg(x),  provided  the  rapidity  of  convergence  of 
(Aw-An-1)/An  to  0  is  not  less,  i.e.  (An-An_1)/Aw  =  o[0(AJ].  We  may  take 
g(x)  to  be  a  suitable  L-function,  a?a(log#)ai(log2a?)a2  ...  (logpa;)ap  —  in 
particular,  any  one  of  the  functions  I/^/JB,  I/logo;,  logx/x,  etc. 

5.    Further  deductions.  It  is  easy  to  prove  that  theorem  1  can  be  given 
a  form  such  as  the  following. 

THEOREM  2.   Hypotheses: 
(i)  <j>(x),  &(x)  are  positive  functions  ofx>0  such  that 

6(x)  =  {<f>(x)l&(x)}Vr  <x     (r>  0).  (5-1) 


(ii)  (f>(x)  is  non-decreasing',  for  0<x'  —  x^nj6(x)1  i}<l,  there  is  an 
such  that 

H.  (5-2) 


*  If  we  omit  the  trivial  part  of  Ananda  Rau's  theorem  corresponding  to/?>a  +  r+l, 
as  well  as  Bosanquet's  relaxation  of  the  condition  /?>Oto/?>—  1,  we  can  state  the 
theorem  thus.  ie^an/(An-An_1)  =  O(X*}andAr(t,))  =o(^),wfcerer>0,0</?<<x  +  r-t-  1. 
Then  A(u)  =  o(w<«'+/M''+1>). 

Clearly  Bosanquet's  arguments  for  the  replacement  of/?>0  by  /?>—l  in  this 
theorem  (2,  243  f.)  justify  the  replacement  in  corollary  1-3  as  well. 
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(iii)  For  o)  —  fjO 


.  (6-3) 

(iv)  \A'(a>)\^3$(a>).  (5-4) 

Conclusion:  A(a>)  =  0[#(cw)].  (5-5) 


In  particular,  if  W(y)-+Q  as  ?/->0  and  (5-4)  is  altered  to  Ar(o))  =  o[$((o)], 
(5-5)  will  become  A(OJ)  —  o[#(<y)]. 

Proof.  We  have  only  to  employ  (5*3)  in  place  of  lemmas  2,  3,  with  the 
result  that  Am  in  lemma  5  and  §§2-2,2-3  is  changed  to  w,  9  being  defined  by 
=  e6(a)). 


5-1.   From  theorem  2  we  can  deduce 

COROLLARY  2-1.  //  <f>(x),  \jr(x)  are  both  positive,  $(x)  is  non-decreasing, 
<f>(x),  i/f(x)  are  both  subject  to  a  condition  like  (5-2)  for  \  x'  —  x  \  ^y6(x),  and 
related  so  that  0(x)  =  {fi(x)/fi(x)}liw>  ^x,r>0,  and  if  AK(a>) 
and  AK+r+\a>)  =  O[^(w)],  then  AK+\w)  =  O[&(a})],  where 


Proof.   For  a)  ~  v 


[°A*(t)dt 

J  u 


.    (5-11) 

Now  setting  J3(&»)  =  AK+1(a>),  we  find  in  consequence  of  a  result  of  Hardy 
and  Riesz  (4,  lemma  6)  that 


whence  J5r(w)  =  0[0(w)].  (5-12) 

Comparing  (5'1  1)  with  (5-3)  and  (5-12)  with  (5-4)  we  see  that  the  proof  can 
be  completed  by  an  appeal  to  theorem  2  after  changing  A(a>)  there  to  B(a>). 
If  in  corollary  2-1  we  alter  the  hypothesis  in  respect  of  AK+r+1(a))  to 
AK+r+l(a))  =  o[^(w)],  the  conclusion  will  become  AK+1((o)  —  o[#(o>)].  If 
further  we  take  ^(o>)  =  a)K,  <j>(d>)  =  a)K+r+1,  the  result  is 


COROLLARY  2-2.  //,  for  K^  0,  r  >  0,  a  series  San  has  its  Riesz  mean  < 
bounded  on  one  side  as  </>->oo  and  if  the  series  is  summable-R(Xn)K  +  r+  1) 
to  zero,  then  the  series  is  8ummable-R(\n,K+  1)  to  zero. 

This  corollary  is  substantially  equivalent  to  a  theorem  of  Doetsch 
(3,  178,  Satz  2). 
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6.  This  paper  would  be  incomplete  if  we  did  not  mention  the  effect,  on 
the  conclusion  of  theorem  1,  of  varying  the  Tauberian  condition  (2-5)  in 
the  manner  of  Karamata  (6  a).  Three  such  variations  are  included  in  the 
following  theorem  which  supplements  corollary  1-2  and  its  O-  version. 

THEOREM  1A.  //  we,  postulate  hypotheses  (i),  (ii),  (v)  as  in  theorem  1, 
without  the  "h"  condition  in  (ii),  then  EITHER 

(iv)  aJ(ATC  -  An_j)  =£  K^  (AJ       OR      (iv')  ^/(A,,  -  An_,) 
ensures  the  conclusion  EITHER 

]      OR 

Further,  the  alteration  of  (v)  to  Ar(a>)  =  o[^(w)]  changes  the  "  O"  in  the 
conclusion  to  "o". 

The  first  alternative  of  the  theorem  follows  from  the  second  half  of  the 
proof  in  §  2-3.  A  proof,  like  the  first  half  of  the  one  in  §  2-3,  can  be  given  for 
the  second  alternative,  using  the  lemma  which  follows  in  place  of  lemma  2. 

LEMMA  2  A.  Under  the  hypotheses  (2-1),  (2-3)  without  the  "h"  condition, 
and(iv'),  A(Xm^)-A(t)<KHe$(Xm)  (o<e^), 

provided  that  Am  —  e#(Am)  ^  t  <  Am. 

Taking  $(x)  —  {xg(x)}r  and  i/r(x)  =  l/xg(x)  in  corollary  1-2  and  theorem  1  A 
we  get  the  following  supplement  to  corollary  1-4. 

COROLLARY  1  A.  Corollary  1-4  remains  true  when  its  two  assumptions: 

an  AnS-(AJ  <  K(Xn  -  Vi),     An  -  Are_t  =  o[ABfl-(AJ], 
are  replaced  by  one  of  the  following  (without  any  other  change)  : 


I  «n  I  *n9(*n)  ^    EITHER  K(\n  -  A^)  OR  K(\n^  -  An)  ; 

An_t)  AND  K(XM~  AJ. 
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UNE  INTERPRETATION  ALG^BRIQUE  DE  LA  SUITE 

DES  ORDRES  DE  MULTIPLICITY  D'UNE 

BRANCHE  ALGfiBRIQUE 

By  CAHIT  ABF 

[Received  7  September  1945.  —  Read  15  November  1945.  — 
Received  in  revised  form  30  September  1946] 

La  suite  des  multiplicites  des  points  successifs  d'une  branche  algebrique 
pent  se  definir  an  moyen  de  notions  purement  algebriques.  Dans  ce  qui  suit 
nous  exposons  une  telle  definition  qui  lie  differe,  du  reste,  de  la  definition 
geom6trique  que  par  la  forme.  Nous  esperons  que  cette  definition  constitue 
une  response  a  la  question  posee  par  P.  Du  Val*  sur  la  relation  qui  doit 
exister  entre  ses  re*sultats  et  les  developpements  en  series  entieres  de  la 
branche  considered. 

1  .  "k  6tant  un  corps  quelconque,  nous  considerons  un  anneau  H  constitue* 
par  des  series  entieres  d'une  variable  t  avec  coefficients  dans  k.  Soient 


les  ordres  (c.a.d.  les  ordres  des  premiers  termes  avec  coefficients  non  nuls) 
des  elements  de  H.  Les  entiers  v*i>  •••>*,.>•••  constituent  un  semi-groupe 
d'entiers  positifs.  SQ,  Silt  Siz,  .  .  .  ,  Sir,  .  .  .  etant  des  e!6ments  d'ordres  respectifs 
i0,  *!,...,  tr,  .  .  .  dans  H,  tout  element  de  cet  anneau  est  de  la  forme 


Nous  admetterons  que  H  contient  toutes  les  series  de  cette  forme.  Nous 
de*signerons  par  Ih  1'ensemble  des  elements  de  H  d'ordres  superieurs  ou 
egaux  a  h.  Ih  est  visiblement  un  ideal  de  H  et  ses  elements  sont  de  la  forme 

(0,6  ft). 


*  P.  Du  Val,  "The  Jacobian  algorithm  and  the  multiplicity  sequence  of  an  alge- 
braic branch",  Rev.  Factdte  Sci.  Univ.  Istanbul  (S6rie  A),  7  (1942),  107-112. 
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THEOREME  ATJXILIAIRE  1.   v  etant  le  p.g.c.d.  des  elements  de  W(H),  pour 
r  suffisamment  grand  on  a 


et  il  existe  une  serie  entiere  d'ordre  1, 


oo 

telle  que  tout  element  de  H  soit  de  la  forme  X  <*>jTjv. 

Demonstration.  Designons  par  vl  le  p.g.c.d.  des  entiers  il9  iz, . . .,  it.  Chacun 
des  nombres  vl9  v2, . . . ,  vhi . . ,  divise  alors  tous  ceux  qui  le  precedent.  II  en 
resulte  que  pour  p  suffisamment  grand  on  a  v  =  vp+i  —  vp+2  —  . . .  —  v. 
Soit  alors  _ 

ml9mZ9  ...,mp  etant  des  entiers  positifs,  nuls  ou  negatifs.  m  etant  le  plus 
grand  des  entiers  |  mu(i^\v  —  1 )  | ,  les  multiples  de  v  qui  sont  superieurs  a 

1  2  p 

sont  contenus  dans  W(H).  On  a  en  effet,  pour  I  —  0,  1, 2, ..., i^v—  1, 

&!  +  (m  +  Im2)  i%  +  . . .  -h  (m  +  lmp)  ip 


avec  TZ^  >  0;  puisque  m  ^  |  m^Z  |  .  Pour  I  —  i^/v,  on  aura  i  +  lv  —  i  +  i±€  W(H). 
D'une  maniere  generate,  les  multiples  de  v  qui  sont  superieurs  a  i  peuvent 
s'ecrire  sous  la  forme  i  +jii  +  lv  (I  —  0,  1,  2,  .  .  .,  i^jv  —  1,  j  ^  0)  et  il  est  evident 

P 
que  tous  ces  entiers  sont  de  la  forme  ^nhih  avec  nh^0;  c.a.d.  contenus 


dans  W(H).  &  =  2°^  (°~i*-k>o'i  =f=0)  etant  un  element  d'ordre  tt  de  H, 

Z=i, 

on  peut  choisir  une  serie  entiere  de  la  forme  r  =  t[  1  +  X  £^  I  ,  (^€  k)  de  maniere 

\      z=i      / 

que  Ton  ait  ^^  =  cr^r^.  Dans  ces  conditions  les  series  entieres  en  t  avec 
coefficients  dans  k  peuvent  s'ecrire  sous  formes  de  series  entieres  en  r  avec 
coefficients  dans  k.  En  particulier  les  elements  de  H  peuvent  s'ecrire  sous  la 

oo 

forme  Sa^T?".  Nous  pouvons  nous  contenter  de  demontrer  ceci  pour  les 

j  =  0 

elements  de  H  qui  sont  d'ordres  superieurs  a  »';  puisque  tout  element  de  H 
peut  etre  considere  comme  quotient  d'un  e!6ment  d'ordre  superieur  a  i  de 
H  par  une  puissance  convenablement  choisie  de  Sit  =  o-^r1^^.  Les  ordres 
des  elements  de  H  etant  des  multiples  de  v,  un  element  quelconque  de  H  est 

de  la  forme  £  a^r?  (a,€&,  0^4=0).   Pour  Nv^i,  Tanneau  H  contient  des 

j-Nv  ' 
SEB.  2.     VOL.  50.     NO.  2395.  R 
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Amenta,  8Nv^9  SNv+Zv,  ...\SNv+lv  =     S     aur',  <%€&,  a^+^OJ  d'ordres 

V  j=Nv+lv  I 


respectifs  A/V  +  v,  iW+2j;,  ....  On  peut  done  choisir  la  s6rie  ^i^iSNv+lv  de 

z=i 
maniere  que  la  difference 


ne  contienne  aucun  terme  d'ordre  divisible  par  v,  autre  que  le  premier. 
Supposons  en  effect  qu'on  ait  pu  choisir  filt  /?2,  ...,  j3h  de  maniere  que  les 
terms  d'ordres  Nv  +  v,  Nv  +  2v,  ...,Nv  +  hv  de  la  difference 


=*v  = 

disparaissent;  il  suffit  alors  de  poser 

>y(/j) 

ft          —      ^Nv+hv+v 
Ph+l  —  ~     '     * 

ah+l,Nv+ht>+i> 

pour  que  les  termes  d'ordres  Nv  +  v,  Nv-\-Zv,  ...,  Nv  +  hv}  Nv  +  hv  +  v  de  la 
difference  w  h+l 

S  a^7'-  S  A^,4Jr  =  a^r^  +  a*+1>r/*Hi+... 

j,.  A^  1  =  1 

disparaissent.    Dans  ces  conditions  la  serie  SNv  se  reduit  a  aNvrNv.    Car 
sinon,  la  difference 


dont  Tordre  n'est  pas  divisible  par  v  serait  contenue  dans  H.  Done  tout 
element  d'ordre  superieur  ^i  i  de  H  est  une  combinaison  lineaire  a  coefficients 
dans  k  des  elements  de  la  forme  aNvrNlf  =  8N)t. 

Remarque.  D'apres  le  th^oreme  qui  precede,  1'anneau  H  peut  etre  con- 
sidere  comme  un  sous  anneau  de  1'anneau  des  series  entieres  de  la  variable 
T  =  rv  avec  coefficients  dans  k.  Posons  *ih  =  ih/v.  Les  ordres  des  elements 
de  H  par  rapport  a  cette  nouvelle  variable  seront  *i0  =  0,  ^"1?  *ia,  ...,  \,  ..., 
et  pour  r  suffisamment  grand,  on  aura 


TH^OEBME  AUXILIAIBE  2.  U  inverse  de  tout  element  d'ordre  zero  de  H  est 
aussi  un  element  de  H. 

00 

Demonstration.  Si  1'ordre  de  a  =  S  #/,&.  est  zero,  a0  est  different  de  zero. 

fc  =  0  * 

Or  les  coefficients  flh  du  produit 
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peuvent  etre  choisis  de  maniere  que  Ton  ait 

)  =  1  ,     mod  «VU. 


Supposons  en  effet  que  ce  choix  ait  pu  etre  fait  pour  filt  /?2,  ...,  /?n_t.  On  aura 


et  il  suffira  de  poser  ftn  =  —  yw  pour  avoir 

aao1  ft  (  1  +  A  <Sy  -  1  ,     mod  *%-H. 

A  =  l  * 

Pour  les  coefficients  flh  ainsi  choisis  on  aura  visiblement 


Remarque.  2  #/,$,-   etant  un  element  d'ordre  zero  de  H,  a  chaque  racine 

A-O        * 

00 

meme  de  a0  contenue  dans  k  correspond  une  racine  meme  de  X  ochSih  con- 

^=o 

tenue  dans  f?.  La  demonstration  de  ce  fait  est  analogue  a  celle  du  theoreme 
auxiliaire  2. 

2.  THEOREME  AUXILIAIRE  3.  Si  Von  designe  par  IJSh  V  ensemble  des 
quotients  des  elements  de  Ih  par  Sh,  et  par  [Ih/Sh]  Vanneau  engendre  par  Ih/Sht 
Vanneau  [Ih/S  h]  ne  depend  pas  du  choix  de  Shparmis  les  elements  d'ordre 
hdeH. 

Demonstration.  Remarquons  d'abord  que  1'ensemble  Ihf/Sh  contient 
1'anneau  H  et  par  consequent  [Ih/Sh]l2H. 

Soit  Sh  =  eSh  un  autre  element  d'ordre  h  de  H  .  e  est  alors  un  element  de 
[Iti/Sh~].  II  en  resulte  d'apres  le  theoreme  auxiliaire  2,  que  e^1  est  aussi  un 
element  de  [Ih/Sh],  On  a  done 

ik/S'h  =  i,jesh  =  ^(A/fl^)  s  [A/sj 

et  par  consequent  [Ihl&h\  —  [-4/^/J- 

On  peut  evidemment  montrer  exactement  de  la  meme  maniere  que 
Ton  a  aussi  rT  ,q  -, 


On  a  done  [Ih/S'h]  =  [Ih/8h]. 

L'anneau  [I^/S^  ne  dependant  pas  du  choix  Sh,  nous  pouvons  le  designer 

par  [Al- 

Remarque.  Le  semi-groupe  W([Iih\)  contient  visiblement  le  semi-groupe 
engendr^  par  les  entiers 


R2 
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qui  sont  les  ordres  des  elements  de  Iih/8ih.  Mais  W([Iih\)  n'est  pas  necessaire- 
ment  identique  a  ce  semi-groupe  comme  le  montre  1'exemple  suivant: 
Consid6rons  1'anneau  H  forme  de  toutes  les  series  de  la  forme 


avec  X  =  £4,    y  =  tlQ  +  tu.    On  montre  facilement  que   W(H)   est   forme 
des  entiers          Q  ^  ^  J()   ^  ^  ^  J8  2Q  22   ^  ^  ^ 

28,  29,  30,  32,  33,  34,  35,  36,  37,  38,  .... 
Les  ordres  des  elements  de  IJX  sont  alors  les  entiers 

0,  4,  6,  8,  10,  12,  14,  165  18,  20,  21,  22, 

24,  25,  26,  28,  29,  30,  31,  32,  33,  34,  ... 

qui  engendrent  le  semi-groupe 

0,  4,  6,  8,  10,  12,  14,  16,  18,  20,  21,  22,  24, 

25,  26,  27,  28,  29,  30,  31,  32,  33,  34,  ... 

tandis  que  [J4]  contient  1'element  (  Y/X  )2  -  X3  =  2J17  4-  Z22  dont  1'ordre  est  17. 
Remarque.  Si  pour  un  choix  particulier  de  Sh,  1'anneau  [Ih]  est  identique 
a  IhjSh>  il  en  est  de  meme  pour  tous  les  choix  de  Sh.  En  effet,  S'h  —  eSh  etant 
un  autre  element  d'ordre  h  de  H,  on  aura 

A  AS;  =  e-*i,jsh  =  e-H4]  =  t/J; 


puisque  tout  616ment  A^  de  [/A]  est  6gal  a  l'61ement  eS  de  [Ih]  multiplie  par  e~l. 

Definition.  Nous  disons  qu'un  anneau  H  est  canonique  si  Von  a  [Ih]  =  IhjSh 
quelque  soit  he  W(H). 

Remarque.  Si  H  est  un  anneau  canonique,  les  entiers 


forment  un  semi-groupe  quelque  soit  A.  (7%  semi-groupe  d'  'entiers  non  negatifs 

*0  =  0,  t'i,  tB,  ...,  tA,  ... 
€5t  appele  canonique  si  la  suite 

*\-*A  =  °»  *A+1~H'  *A+2~U>   -•• 

es£  ttn.  semi-groupe  pour  chaque  h.  Si  la  suite  des  entiers  croissants 

*o  =  °>  *i5  ^2»  •••>  **»  ••• 
est  un  semi-groupe  canonique,  les  series  entieres 
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forment  visiblement  un  anneau  canonique.   W(H)  pent  etre  canonique  sans 
qu'il  en  soit  de  meme  pour  H:  L'anneau  H  form6  des  series  de  la  forme 

=  £4,  Y  =  tw  +  t™,Z  =  £27,  est  tel  que  les  ordres 


0,  4,  8,  10,  12,  14,  16,  18,  20,  22,  24,  25,  26,  27,  28,  29,  30,  ... 

de  ses  elements  forment,  comme  on  le  verifie  facilement,  un  semi-groupe 
canonique,  alors  que  //  n'est  pas  un  anneau  canonique,  puisque  [/4]  contient 
1'element  d'ordre  17,  (T/X)2-^3  =  2J17  +  *22,  qui  n'est  pas  contenu  dans 
IJX. 

THEOREME  AUXILIAIRE  4.  La  partie  commune  de  plusieurs  anneaux 
canoniques  est  un  anneau  canonique. 

Demonstration.  II  suffit  evidemment  de  demontrer  le  theoreme  pour  la 
partie  commune  de  deux  anneaux  canoniques  seulement.  H  et  Hr  etant 
deux  anneaux  canonique,  soit  S  un  element  commun  de  ces  deux  anneaux. 
Soit  h  Tordre  de  $  et  soient  Ih  et  I'h  les  ensembles  d'  elements  de  H  et  de  //' 
dont  les  ordres  ne  sont  pas  inferieurs  a  h.  II  suffit  de  montrer  que 


est  un  anneau.  Or  Ih/S  et  I'hj8  etant  des  anneaux,  il  en  est  de  meme  de 
leur  partie  commune. 

Remarque.  Si  H  est  un  anneau  canonique,  il  en  est  de  meme  de  [/ij. 
Considerons  en  effet  Fensemble  des  elements  de  Iih.  Ces  elements  sont  de 
la  forme 


H  etant  un  anneau  canonique  1'anneau  [/^]  est  constitue  par  1' ensemble  des 

00  $. 

series  de  la  forme  2  aj/  ^  dont  les  ordres  sont  les  nombres 

L'ensemble  des  elements  de  [Iih\  d'ordres  superieurs  ou  egaux  a  jt  est  alors 
1'ensemble  des  series  de  la  forme 

oo  .Cf. 

2      0^-~       (<*„€&). 

SifuJSifi  etant  un  element  d'ordre  jl  =  ih+i  —  ^h  de  cet  ensemble,  Tensemble 
des  elements 


constituent  1'anneau 
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$  £tant  Fensemble  de  tous  les  entiers  non  negatifs,*  on  montre  d'une 

maniere  analogue  que  si      fn    .     . 
6 


est  une  semi-groupe  canonique,  il  en  est  de  meme  de 


Remarque.    Si  les  entiers 

*0  =  0,  »±,  *a,  ...,  »A,  ... 

forment  une  semi-groupe  canonique  on  a  *Vfi~*A^*A~**-i-  En  effet,  les 
entiers  ^_i~  H_i  =  0,  ih  —  ih-^  */H-I~*A-I»  •••»  *r~%-i»  •••  devant  former  un 
semi-groupe,  on  doit  avoir  ^+1  —  ^-1  <2(ife—^_1);  d'ou  resulte  immediate- 
ment  I'inegalit6  *A+1  —  i/t  <  */t  —  ^_i- 

3.  D'apres  la  remarque  qui  suit  immediatement  le  theoreme  auxiliaire  1, 
I(N-I)V  contient  toutes  les  series  entieres  dont  les  ordres  en  T  —  rv  sont 
sup&rieurs  ou  egaux  a  N  —  1  pourvu  que  N  soit  suffisamment  grand.  [/<#_!)„] 
est  alors  1'anneau  k\T]  de  toutes  les  series  entieres  en  T  avec  coefficients 
dans  k.  Cette  remarque  nous  conduit  a  la  construction  suivante  qui  nous 
permet  d'obtenir  tous  les  anneaux  canoniques  ainsi  que  tous  les  semi- 
groupes  canoniques: 

On  commence  par  considerer  1'anneau  [/(#_!)„]  =  k[T]  de  toutes  les  series 
entieres  en  T  de  merne  que  le  semi-groupe  Qbv  des  multiples  entiers  non 
negatifs  de  p.  On  choisit  un  element  d'ordre  non  nul,  Tr_l  de  [/ov_i>J  de 
meme  qu'un  element  non  nul  vr_±  (  —  t^(77r_1)t)  de  &v  et  Ton  pose 


L'anneau  [/ir_J  et  le  semi-groupe  {0,  ^r_!  +  ®^}  (=  ^(K^J))  sont 
canoniques.  On  choisit  de  meme  un  element  Tr_z  d'ordre  non  nul  dans 
l/ir-J  ainsi  qu'un  entier  positif  vr__z  (=  w(Tr_2))  dans  {0,  ^i  +  ^Jv},  et  Ton 

pose 


On  obtient  ainsi  un  nouvel  anneau  canonique  ainsi  qu'un  semi-groupe 
canonique.  En  continuant  de  cette  maniere  on  obtient  finalement  1'anneau 
canonique 


*  Dans  ce  qui  suit  ©  dcSsignera  toujours  1'ensemble  de  tous  les  entiers  non  negatifs. 

(00  \  00 

2  oiit*  I  d^signe  1'ordre  en  t  de  la  serie  entiere  S  a^<. 
i-u       /  i=u 
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et  le  semi-groupe  canonique 

l  Th+2  +  .... 


4.  Etant  donne  un  anneau  H  la  partie  commune  de  tons  les  anneaux 
canoniques  contenant  H  est  un  anneau  canonique  *H  que  nous  appelerons 
la  fermeture  canonique  de  H.  De  meme  O  —  {0,  ilt  z2,  .  .  .,  v_i  +  Gbv}  etant  un 
semi-groupe  d'entiers  non  negatifs  (v  =  (tl5  i2,  ...^ir_^  ir__i  +  v)),  la  partie 
commune  de  tous  les  semi-groupes  canoniques  contenant  O  est  un  semi- 
groupe  canonique  *6r;  nous  Fappelerons  la  fermeture  canonique  de  O. 

D'apres  cette  definition  il  est  clair  que  W(*H)  contient  ie  semi-groupe 
canonique  *W(H  );  mais  ces  deux  groupes  ne  sont  pas  riecessairement  egaux, 
puisque  W(H)  peut  etre  canonique  sans  que  H  le  soit. 

5.  Etant  donne  le  semi-groupe 


la  fermeture  canonique  *6?  de  O  peut  s'obtenir  de  la  maniere  suivante: 
On  considere  le  semi-groupe  {0,  %  +  (2-J  ou  Gl  est  le  semi-groupe  des  entiers 

de  la  forme  ,  .       .  x         .  .       .  x  ,  .       .  . 

aa(*2-*i)  +  as(*3-*i)+---+a»(*n-h)> 

les  coefficients  a1?  a23  •  •  •  >  aw  etant  des  entiers  non  negatifs.  Le  semi-groupe 
{0,  i±  +  GJ  qui  contient  alors  G  est  visiblement  contenu  dans  *G.  Remarquons 
que  les  e!6ments  de  G^  qui  sont  inferieur  a  ih+i  —  i^  sont  de  la  forme 


les  nombres 


avec  n^h+1,  an4=0  sont  en  effet  superieur  ou  6gaux  a  ih+l  —  iv  En  par- 
ticulier  1'element  le  plus  petit  de  G^  est  iz  —  i±.  II  en  resulte  en  outre  que  les 
elements  de  {0,  it  -f-  (7J  qui  sont  inf«6rieurs  a  ih+1  ne  dependent  que  des  nom- 
bres il9  i2,  ...,  ih,  dont  ils  sont  des  combinaisons  lin&dres  a  coefficients 
entiers.  Le  semi-groupe  {0,1'j-f  G^}  etant  contenu  dans  *6?,  il  en  serait  de 
meme  de  {0,  i±  +  *G^  qui  contient  {0,  tx  -f-  G-^  2  G,  et  qui  est  canonique.  On 
a  done  *Q  =  {0,  i±  +  *^i}.  La  construction  de  *G  est  ainsi  ramenee  a  la 
construction  de  la  fermeture  canonique  d'un  semi-groupe  de  la  forme 


pour  lequel  on  a  i^'-i  ^  V-i  ~  H-  ^a  repetition  de  cette  construction  nous 
pennet  done  de  ramener  la  construction  proposee  a  celle  de  la  fermeture 
canonique  d'un  semi-groupe  GN  qui  se  reduit,  pour  N  suffisamment  grand, 
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au  semi-groupe  ®v  de  tous  les  multiples  non  negatifs  de  v.  %v  etant  sa 
propre  fermeture  canonique,  la  construction  proposee  se  trouve  ainsi 
eifectuee.  Remarquons  que  les  elements  de  *G  ainsi  construits  ne  dependent 
que  des  e!6ments  de  O  qui  ne  les  depassent  pas;  et  qu'ils  en  sont  des  com- 
binaisons  lineaires  a  coefficients  entiers.  Supposons  en  effet  que  ceci  ait  ete 
demontre^  pour  la  fermeture  *G^  de  G±.  Les  elements  de  ^G^  qui  sont  in- 
ferieurs  a  ih+i~  ilt  ne  d6pendent  alors  que  des  elements  de  O^  qui  sont  in- 
ferieurs  a  ^+1  —  ilt  et  ils  en  sont  des  combinaisons  lineaires  a  coefficients 
entiers;  or  ces  derniers  ne  dependent  a  leurs  tours  que  des  nombres  ilt  iz,  .  .  .  ,  ih 
dont  ils  sont  des  combinaisons  lineaires  a  coefficients  entiers.  II  en  resulte 
que  les  elements  de  {0,  i^  +  *G1}  =  *G  qui  sont  inferieurs  a  ih+1  ne  dependent 
que  des  il9  i2,  ...,  ih,  dont  ils  sont  des  combinaisons  lineaires  a  coefficients 
entiers. 

Etant  donn6  le  semi-groupe  canonique 

*#  =  {Q,iltiit...,ir,i  +  ®v}     (v  =  (»1,*a»»-»V.i»V-i  +  "))» 
il  n'existe  qu'un  nombre  fini  de  semi-groupes  g  tels  que  *g  =  *£r.  Soit  en  effet 


un  tel  semi-groupe.  Solent^,  j2,  "••>jn  ceux  des  entiers  JDJ2,  •••»/s>  •••  °iui 
sont  inferieur  a  ir+l  =  ir_1~f-2^.  ir_1  et  *r-i  +  v  etant  des  combinaisons 
lineaires  a  coefficients  entiers  des  jlt  jz,  ...,  jn,  le  p.g.c.d.  de  ces  nombres 
est  v.  Or  a  chaque  systeme  d'entiers  positifs  inferieurs  a  ir_±  -f  2v  dont  le 
p.g.c.d.  est  v,  on  peut  associer  un  multiple  jv  de  v  tel  que  tout  semi-groupe 
d'entiers  non  negatifs  contenant  le  systeme,  contienne  tous  les  multiples 
de  v  superieurs  a  jv.  Soit  Lv  le  plus  grand  des  entiers  jv  qui  sont  ainsi 
associes  aux  systemes  de  multiples  positifs  de  v  inferieurs  a  ir_l  -\-  2v.  Les 
semi-groupes  g  pour  lesquels  on  a  *g  —  *G  contiennent  alors  tous  les  mul- 
tiples de  v  qui  depassent  Lv  et  ne  different  entre  eux  que  par  leurs  elements 
qui  sont  inferieurs  a  Lv. 

TH^OE^JME  1.  La  partie  commune  de  tous  les  semi-groupes  g  tels  que 
*g  =  *G,  est  un  semi-groupe  gx  tel  que  *gx  —  *G. 

Demonstration.  Soib  g  un  semi-groupe  tel  que  Ton  ait  *g  =  *6r  et  tel 
qu'aucun  sous  semi-groupe  de  g  ne  possede  cette  propriete;  nous  allons 
montrer  que  g  =  gx.  Soit  i  le  plus  petit  element  de  g  qui  n'est  pas  contenu 
dans  gx.  Soient  i0  =  0,  ilt  iz,  .  .  .  ,  ih  les  elements  de  g  et  de  gx  qui  sont  inferieurs 
at.  i  n  'etant  pas  contenu  dans  gx,  le  nombre  i  n'est  pas  de  la  forme 


avec  ax,  a2,  .  .  .  ,  ah  entiers  non  negatifs.  D'autre  part  gx  etant  la  partie  com- 
mune des  semi-groupes  dont  la  fermeture  canonique  est  *6r,  il  existe  un 
semi-groupe  g'  tel  que  *g*  =  *G  qui  ne  contient  pas  le  nombre  i.  Les  elements 
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de  *G  —  *</  qui  sont  inferieurs  a  i  ne  dependant  que  des  entiers  ilt  t2,  ...,  ih, 
le  semi-groupe  g"  obtenu  en  suprimant  dans  g'  tous  les  entiers  positifs 
inferieurs  a  i  autres  que  ilt  i2t  ...,ih  est  encore  tel  que  "g"  =  *(3.  II  en  resulte 
que  les  elements  de  *6r  qui  sont  inferieurs  ou  egaux  a  i  ne  dependent  que  des 
nombres  ilt  *2,  •-  •  »  H;  puisque  g"  ne  contient  pas  le  nombre  i.  Done  la  ferme- 
ture  du  sous  semi-groupe  de  g  obtenu  en  y  suprimant  le  nombre  i  est  encore 
6gal  a  *6?.  Contrairement  au  choix  de  g.  On  a  done  gx*=  get  par  consequent 

vx-*0. 

Le  sous  semi-groupe  gx  defini  dans  1'enconce  du  theoreme  1  s'appelle  le 
sous  semi-groupe  caracteristique  de  tous  les  g  tels  que  "g  =  *Q.  II  est  clair 
que  le  semi-groupe  gx  est  tel  que  tout  sous  semi-groupe  propre  de  g  ait  une 
fermeture  canonique  distinct  de  *gx  =  *Q.  Inversement  si  gx  est  tel  que  pour 
tout  sous  semi-groupe  propre  g'  de  gx  on  ait  V  =t=  *gx,  gx  est  son  propre  sous 
semi-groupe  caracteristique. 

gx  =  (0,*!,^,  •••j*r-i»V»  •••}  etant  le  sous  semi-groupe  caracteristique  de 
g,  considerons  les  entiers  ;&,  #a,  ...,  ^  definis  de  la  maniere  suivante: 
Xi  —  *i  J  ^2  est  1°  Pms  petit  de  ceux  des  entiers  i1?  *2,  .  .  .  ,  ir,  .  .  .  qui  ne  soient  pas 
de  la  forme  a^Xi  avec  <*i>  entier  non  negatif  ;  ^3  est  le  plus  petit  de  ceux  des 
entiers  ilt  i2,  ...,  *r,  ...  qui  ne  soient  pas  de  la  forme  &iXi  +  <x>2X*  avec  alf  az 
entiers  non  negatifs;  enfin  %i,  X*>  •->  X*  etant  definis  xn+i  est  le  P^s  petit 
de  ceux  des  entiers  ils  i2,  .  .  .  ,  ir,  .  .  .  qui  ne  soient  pas  de  la  forme 


avec  alt  aa,  ...,  an  entiers  non  negatifs.   Les  nombres  Xi>  Xz>  -••»  iV/i  ainsi 
definis  s'appellent  les  caracteres  de  ^. 

THEOREME  2.  yx  <  y2  <  .  .  .  <  yl  etant  un  ensemble  d'  entiers  positifs,  I'  ensemble 
des  caracteres  du  semi-groupe  g  des  entiers  de  la  forme 


avec  al5  a2,  .  .  .  ,  az  entiers  non  negatifs,  est  contenu  dans  I'  ensemble  yls  y2,  .  .  .  ,  yz. 
Demonstration.  Soit  ^  le  plus  petit  des  caracteres  #1,  #8»  •••»  ^/i  de  g  qui 
ne  soient  pas  contenus  dans  1'ensemble  yl9  ya,  ...,  y,.  D'apres  la  definition 
de  gx,  Xi  est  de  la  forme  a1yl  +  a2y2  +  .  .  .  +  cnvyv  avec  ax,  a2,  -  .  .  ,  ar  entiers  non 
negatifs;  ylf  y2,  ...,  yr  etant  ceux  des  entiers  yl9  ya,  ...,  y,  qui  sont  inferieurs 
a  %t-  7i»  72>  •••»  7i  etant  des  elements  de  la  fermeture  canonique  de  ^,  tout 
semi-groupe  canonique  contenant  #lt  ^2,  .  .  .  ,  ^^j  contient  aussi  yx,  y2,  .  .  .  ,  y  ,. 
Ceci  impliquerait  que  la  fermeture  canonique  du  semi-groupe  des  com- 
binaisons  lineaires  a  coefficients  entiers  non  negatifs  de  Xi>  X&  •••»  Xt-i> 
Xi+i>  -  •  -  »  Xh  contient  ^,  et  il  en  r^sulterait  que  g  n'est  pas  un  semi  groupe 
caracteristique.  Done  1'ensemble  yx,  ya,  ...,  y,  contient  necessairement 
Fensemble  Xi,  X*  •»,;&• 
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TH^JOBBME  3.  g  etant  le  semi-groupe  des  combinaisons  lineaires  a  coefficients 
entiers  non  negatifs  de  0  <  yl  <  ya  <  .  .  .  <  yz,  les  entiers 

"i>  *t>  •••>  VN-K  "AT-I>  v 
tels  que  Von  ait 


se  deduisent  de  ylt  y2,  ...,  yl  par  Valgorithme  quasi-jacobien  de  Du  Val* 
Les  entiers  vlt  t>2>  •  •  •  >  VN-\>  v  yfigurent  comme  diviseurs,  tandis  que  les  quotients 
partiels  representent  les  nombres  de  fois  que  chacun  des  diviseurs  est  repete 
dans  la  suite  vl5  v2,  ...,  vN_^9  v-  R&ciproquement  si  les  nombres 


se  deduisent  de  y1}  y2,  ...,  yl  par  V  algorithm?,  quasi-jacobien  de  Du  Val,  les 
quotients  partiels  etant  les  nombres  defois  que  chacun  des  diviseurs  figure  dans 
la  suite  yl5  v2>  •••>  VN-\>  v->  la  fermeture  canonique  du  semi  groupe  des  entiers 
de  la  forme 


avec  04,  a2,  .-.,  az  entiers  non  negatifs,  est  *g. 

Demonstration,  v  6tant  le  plus  grand  commun  diviseur  des  elements  de  g, 
on  a  7j  ^  v.  Si  yj  =  v  le  semi-groupe  g  est  constitue  par  Fensemble  de  tous 
les  multiples  de  y±  =  v,  et  Ton  a  g  =  *g  =  {(&v}.  Dans  ces  conditions 
1'algorithme  se  termine  des  le  commencement.  Admettons  que  la  proposition 
ait  et6  etablie  pour  tout  ensemble  y[  <  y'2  <  .  .  .  <  y\>  pour  lequel  y^  <  yl5  et 
etablissons  le  pour  1'ensemble  yl5  ya>  ...,  yz.  Soit  y^  le  plus  petit  des  entiers 
7i»  72>  •  •  •>  Ji  ^u^  ne  8O^  Pas  divisible  par  ylt  Soit  ^  le  quotient  de  yi  par  yx  et 
considerons  le  semi-groupe  F  des  combinaisons  lineaires  a  coefficients 
entiers  non  negatifs  des  nombres  y;  —  ^y!  y<+1  —  qy^  ,  .  .  ,  yl  —  gyls  yj_.  Le  semi- 
groupe  "g  contient  visiblement  le  semi-groupe  {0,  yx,  2yx,  .  .  .  ,  qy±  +  F}  qui 
contient  g.  On  a  done 

*<7  =  {0,y1,2y1,...,gy1  +  T}, 

c.a.d.  P!  =  yl5  v2  =  y1?  ...,  vq  =  y1? 


Les  entiers  yi  —  qy^  yi+1  —  qyly  .  .  .  ,  yl  —  qyi}  yx  etant  les  restes  de  la  (i  —  1  )-ieme 
division  de  Talgorithme  appliqu6  aux  nombres  yly  y2,  ...,  yt,  il  suffit  de 
montrer  que  les  entiers  vq+l,  vq+2,  ...,  ^_i,  v  s'obtiennent  en  appliquant 
Falgorithme  aux  entiers  y^-^y^  yi+i-^yi,  ...,  7i-qyi,  7r  Or  y^-qy^ 
etant  inferieur  a  ylt  ceci  a  ete  admis  comme  etabli.  R6ciproquement,  si  les 

*  Du  Val,  loc.  cit. 
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nombres  J>15  v2,  .  .  .  ,  VN-I^  v  se  deduisent  de  y^  y2,  .  .  .  ,  y\  par  1'algorithme  quasi  - 
jacobien  de  Du  Val,  la  fermeture  canonique  du  semi-groupe  des  com- 
binaisons  Iin6aires  a  coefficients  entiers  non  negatifs  de  yr,  ya,  ...,  yl  est 


d'apres  la  proposition  directe  que  Ton  vient  d'etablir. 

Xi>  Xa>  ••'»  Xh  etant  les  caracteres  de  g,  le  semi  groupe  des  combinaisons 
lineaires  a  coefficients  entiers  non  negatif  de  ^lf  ^2,  .  .  .  ,  Xn  es^  ^e  sous  semi- 
groupe  caracteristique  gx  de  gr.  II  en  resulte,  d'apres  les  theoremes  3  et  2, 
que  les  entiers  vltv2t...t  VN_^  v  se  deduisent  des  caracteres  de  g  par  1'algorithme 
quasi-  jacobien  de  Du  Val,  et  que  tous  les  systemes  d'entiers  non  negatifs 
7i>  7z>  •••>  7z>  pour  lesquels  1'algorithme  fournit  le  meme  resultat,  se 
deduisent  du  systeme  des  caracteres  de  g  en  lui  adjoignant  des  nombres 
arbitrairement  choisis  dans  *gr. 

6.  Considerons  main  tenant  un  anneau  H  et  sa  fermeture  canonique  *H. 
L'anneau  H  etant  de  la  forme 


sa  fermeture  canonique  *H  peut  se  construire  de  la  maniere  suivante: 
Designons  par  H±  1'  anneau 


oil  la  sommation  est  etendue  a  tous  les  systemes  d'exposants  entiers  non 
negatifs  aa,  as,  ...,  ah_l  tels  que  a2(i2-i1)4-as(i8-i1)4-  ...  +aft_1(iA_1-t1) 
soit  inferieur  a  ih  —  i±.  La  fermeture  canonique  *H  de  H  contient  visiblement 


qui  contient  H  et  Ton  a  *H  =  k  +  *H-LSil,  en  designant  par  *Hlt  la  fermeture 
canonique  de  H^  D'une  maniere  generate  Ht  etant  defini,  designons  par 
Hi+1  1'  anneau  deduit  de  Ht  de  la  meme  maniere  que  H^  est  deduit  de  H  . 
II  est  clair  que  pour  N  suffisamment  grand  HN  est  identique  a  k[T].  Soit 
Ti+±  un  Element  d'ordre  positif  le  plus  petit  de  Ht.  On  a  alors  visiblement 


*H  =  k  +  fcZl  4-  *H2TiTt  (avec  T  =  St) 


Remarque,  n  etant  un  entier  quelconque,  Tanneau  k  +  H^S^  modP  ne 
depend  que  de  H  mod  tn.  Pour  le  montrer  il  suffit  d'observer  que  H^  mod  tn~^ 


268  CAHIT  ABF  [15  Nov. 

ne  depend  que  de  H  mod  tn.  De  meme  1'anneau  k  ~f-  Hz  Tz  mod  tn~^  ne  depend 
que  de  JE^  modJn-'i.  L'anneau  k  +  kT^  +  H^T^  modP  ne  depend  done 
que  de  H  mod  tn.  En  continuant  de  cette  maniere  on  observe  finalement  que 

*H  =  k  +  kT1  +  kT1Tz+..,+kT1T^...TN__l-^HNT1T2...TN    modtn 
ne  depend  que  de  H  modtn. 

TH^OB^ME  AUXILIAIRE  5.  Si  H  modtn  est  identique  &  *H  mod  tn,  V  ensemble 
*H  mod  tn+l  est  identique  a  Vun  des  ensembles 

k  +  kS^  +  kS^+.-.+kSi^  +  lft^S^modLt"**     (•,<»+!). 

Demonstration.  L'ensemble  *H  modP  etant  identique  a  H  modP, 
1'  ensemble  *H  modi71*1,  qui  contient  1'ensemble  H  modi7**1,  est  constitue 
par  les  Elements  de  la  forme 


ou  S  est  un  Element  de  H,  *Sn  un  element  fixe  d'ordre  n  de  "H  ,  et  a  un  element 
de  k.  Done  tout  anneau  H'  modin+1,  contenant  1'anneau  H  modi71"1"1,  est 
identique  a  H  modP4'1,  s'il  est  contenu  dans  *H  niodin+1  sans  lui  etre 
identique.  Consideroiis  maiiitcnant  Tanneau 


qui  contient  H  modtn+1  et  qui  est  contenu  dans  *H  modin+1.  D'apres  ce 
que  nous  venons  de  remarquer  1'anneau  k  +  S^IfJ  modP+1  est  identique 
a  Tun  des  deux  anneaux 

"H  mod  tn+l,     H  mod  tn+l. 

S'il  n'est  pas  identique  au  premier  on  a  [/J  =  liJS^  modtn+1~ii.  Comme 
*[/f  J  mod  tn+1~^  ne  depend  que  de  [7f  J  mod  tn+l~^,  les  ensembles 

cr 

*[J^]  mod  «»+i-*i,     Jb  4-  —5  *[/<  J  mod  i^1-*! 

seront  alors  identiques,  puisque  ItJS^  est  Tensemble  des  elements  d'ordre 
positifs  de  I^JS^.  II  en  resulte  que  *H  modtn+l  est  identique  a  1'un  des  deux 
anneaux 

k  +  fljjlj  J  mod  P+1,     A;  +  M^  +  *[/,  J  ^ia  mod  ^+r. 

Si  *jff  modfn+1  n'est  identique  ni  k  fc  +  ^Jjy  modtn+1  ni  a 

mod  «*+\ 


ces  deux  anneaux  sont  identiques  a  H  modtw+1.  Dans  ces  conditions  on  aura 
[7t-J  s  IiJSit  mod  tn+i--i*t  dont  on  peut  deduire  Fidentit6  des  deux  ensembles 

*[/,  J 
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*H  modP+1  est  done  identique  &  Tun  des  ensembles 

*  +  [JiJ  Sit  mod  tn+\     k  +  kSit  +  [^ J  8it  mod  tn+\ 
k  +  kS^  +  kSiz  +  *[/,J  8t9  mod  P+1. 

En  continuant  de  cette  maniere  on  montre  que  *H  mod£n+1  est  identique 
a  1'un  des  ensembles 

fc  +  KJ^  mod«n+1, 

k  +  kStl  +  [/<  J  Sit  mod  t«+\ 


k  +  k8tl  +  kSi%  +...  +  [jy  Sit  mod  P+1, 


...+kSit  +  *[^+J  fy  +i    mod  *"+*. 

Or  pour  iz+1  ^  n  +  I  ,  le  dernier  de  ces  ensembles  est  //  mod  <n+1.  *,ff  mod  tn+l 
est  done  identique  a  1'un  des  ensembles 


avec 

JLI?  JT2,  ...,  Xn  etant  des  series  entieres  d'ordres  positifs  en  t,  nous  desig- 
nerons  par  k[X1}  Xz,  .  .  .,  Xn]  1'anneau  forme  des  series  de  la  forme 


avec  &j1jz...jntk,  la  sommation  etant  etendue  a  tous  les  systemes  d'entiers 
Ji>  J2>  •  -  •  >  Jn  non  negatifs. 


THEOREME  AUXILIAIRE  6.  i>e5  elements  Yl}Y2,  ...,YV,  ...  de  *#  eto^  choisis 
de  maniere  que  w(Yj)  soil  le  plus  petit  element  de  W(*H)  non  contenu  dans 
W(k[YltY^  ...,!}_!]),  si  les  elements  Y{,  Y'2,  ...,  Y'v_lt  sont  respectivement 
congrus  a  Y19  Y2,  ...,  Yv^l  mod^F^,  le  plus  petit  element  de  W(*H)  non  contenu 
dans  W(k[Y'lt  FJ,  ...,  F^J 

Demonstration.    Les  anneaux 


*#  mod  V*Tv\     k[Yl9  Fa,  .  .  .  ,  r^.J  mod  ^Fv),     jfc[  7J,  .  .  .  ,  F^J  mod  ««<^) 

etant  visiblement  identiques  il  suffit  de  montrer  que  k[Y[,  Y^,...,  F£_ 
ne  contient  pas  d'616ment  d'ordre  w(Yv).  Tout  element  de  k[Y'lt  Y'2,  ...,  FJ_ 
etant  de  la  forme 


oil  P(FJ,  FJ,  ..-,  F^_t)  est  un  polynome  a  coefficients  dans  k,  il  suffit  de 
montrer  que  w(P(  FJ,  F^,  .  .  .,  F^))  ne  peut  etre  egal  a  w(Yv).  Si  le  polynome 
P(Y(,  Fg,  ...,  F^j)  contient  un  terme  constant,  FJ,  Fg>  •••»  ^v-i  etant  tous 
d'ordres  positifs,  on  aura  w(P(Y(9  Y^  ...,  Y'v_t))  =  04=u>(FF).  Si 
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contient  des  termes  de  degr6s  1  sans  contenir  de  terme  constant,  on  peut 
Tecrire  sous  la  forme 


i,  FJ,  ...,  FJ.J+^FJ  +  P.fFi,  T't,  ...,  FU) 

avec  /?=i=0;  P2(Fj[,  Y'2,  •••»  Y'v-i)  etant  la  somme  des  termes  de  degres  non 
nuls  par  rapport  a  P  ensemble  FJ,  FJ+1,  ...,  Tfv_^  sauf  le  terme  flY'j. 
w(P2(Yfl9  Y'z,  ...,  Y'V_J)  est  alors  superieur  a  w(Y'j)  qui  est  par  definition 
different  de  1'ordre  de 


On  a  done 

«KP<FI,  r;,  ....  FUH  = 

Si  enfin  P(FJ,  Y'z,  ...,  F^)  lie  contient  aucun  terme  de  degr6  1  ni  de  degre 
zero,  on  peut  ecrire 

P(Fi,  F;,  ....  F^sPfF^F,,  ....F^)  mod^W1 
T5L,  F2,  .  .  .,  I^_1))  etant  different  de  w(Ylf)  il  en  est  de  meme  de 


THEOREME  ATTXILIAIRE  7.  Fl5  F2,  ...,  Yv_l3  Yv  et  FJ,  FJ,  ...,  Y'v_^  ayant  Us 
mimes  significations  que  dans  Venonce  du  theoreme  auxiliaire  6,  si  lafermetnre 
canonique  de,  k\Y^  F2,  ...,1^/_1]  ne  contient  pas  d*  element  d'ordre  w(Yv),  il  en 
est  de  meme  defermeture  canonique  de  k[Y{,  FJ,  ...,  F^_J. 

Demonstration.  Soient  iQ  =  0,  ilt  i%,  ...,  i^  ...  les  orders  des  elements  de 


'v_^\  ecrits  dans  Tordre  croissant  et  soit  /^  Tensemble  des 
elements  de  k[Y{,  Y'2,  ...,  F^_t]  dont  les  ordres  ne  sont  pas  inferieurs  a  i^ 
Designons  par  S'it  un  Element  d'ordre  il  de  k[Y[,  Y'2,  ...,  FJ_J,  et  par  §'  la 
fermeture  canonique  de  k[Y[,  Y'2,  ...,  Fy_J.  Les  anneaux 


*JBT  mod^Fv),     ©'  modf^^),     k[Y[,  FJ,  ...,  Fi_ 

etant  identiques,  d'apres  le  theoreme  auxiliaire  5  1'anneau  §'  modtW7(F>')+1 
est  identique  a  1'un  des  anneaux 


avec  it<w(Yv).  Soit  /£  le  plus  petit  des  entiers  I  pour  lesquels  cette  identite 
aHeu.  Si/t  =  0,  £'  mod  ««<nH-i  est  identique  a  Jfc[Fi,  FJ,  ...,  F^Jmodf1^1^^1 
qui  ne  contient  pas  d'element  d'ordre  ^(Fj,).  Supposons  done  que  fi  est 
positif.  Pour  montrer  que  §'  ne  contient  pas  d'el6ment  d'ordre  w(Yv),  il 
suffit  de  montrer  que  [/^-  ]  ne  contient  pas  d'element  d'ordre  w(Yv)—ifi. 
Soient  7€  et  *It  les  ensembles  des  elements  d'ordres  non  inferieurs  a  i^  de 
k[Ylt  .  .  .,  i^_J  et  de  *jfif.  Les  anneaux 

*H  mod  ^^>,     ^[F1?  F2,  .  .  .  ,  F^J  mod  e^Fv),     A;[  FJ,  FJ,  .  .  .  ,  F^J  mod  V**J 
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etant  identiques,  il  en  est  de  meme  des  ensembles 


Sip  mod  &**»>-**, 
oh  St   est  une  element  de  k[Ylt  F2,  ...,  I^J,  tel  que  Ton  ait 

8t  =  8'i      modJ™W. 
*/*        l/f 

II  en  resulte  qu'on  peut  associer  a  tout  element  Z'  de  I^JS^  un  element  Z 
de  1^  //Sj   de  maniere  que  Ton  ait 

Z  =  £' 


Considerons  en  particulier  un  ensemble  d'elements    JZrJ_,   ZJ,    ...,  Z'p  de 
7J  //SJ   choisis  de  la  maniere  suivante: 

(1)  Z'i  est  un  element  d'ordre  positif  le  plus  petit  de  /J  /8rt  , 

(2)  Z{,  Z2,  ...,  Z'j^  ^tant  choisis,  on  choisit  Z'$  de  maniere  que  w(Z'j)  soit 
le  plus  petit  element  positif  de  JP(/J  /^  )  qui  ne  soit  pas  contenu  dans 

*      * 


(3)  ^(Z'p)<w(rv)-i/t-f  1  et  tout  element  de  W^/Si)  inferieur  a 
w(Yv)-i^  I  soit  contenu  dans  W(k[Z'l9  Z'2,  ...,  Z'p]). 

k[Y{9  Y'z,  ...,  Fi_J  mod^F">+1  etant  distinct  de  £>'  mod^^H-1  tandis  que 
k[Y{,  Fg,...,  FJ_J  mod^^  est  identique  a  $'  mod^F">,  1'anneau 
k[Y[t  Y'z,  ...,  Fy_J  ne  peut  contenir  d'elements  d'ordres  w(Yv).  II  en  resulte 
que  les  nombres  w(Z{),  w(Z'2),  ...,  w(Z'p)  sont  inferieurs  a  1^(1^)  —  ^^.  Des 
conditions  imposees  au  choix  des  Z(,  Z'2,  ...,  Z'p  r6sulte  en  outre  Fidentite 
des  anneaux 


k[Z[,  Zi,...,  Z'p] 

il  suffit  done  de  montrer  que  k[Z'lf  Z'2)  ...,  Z'p]  ne  contient  pas  d'element 
d'ordre  w(Yv)  —  ift.  Or  soient  Zlt  Z2,  ...,  Zp  les  elements  de  /t-  /^-  tels  que 
1'on  ait  ^sZ,  mod-^FvMAI  ^  =  1?2)  jp) 

La  fermeture  canonique  de  k[YltY2t  ...,Fl,_1]  ne  contenant  aucun  element 
d'ordre  w(Yv),  I'anneau  k[Zlt  Z2,  ...,  Zp]  ne  contient  aucun  element  d'ordre 
w(Yv)-iJl.  Les  elements  Zt,  Zz,  ...,  Zp,  Zp+1  =  Yvl8i  de  [^  ]  et  les 
Z'It  Z'2,  ...,  ftp  remplissent  done  les  conditions  de  Fenonce  du  theoreme 
auxiliaire  6  vis  a  vis  de  I'anneau  canonique  [*/$  ].  L'anneau  k[Z[,  Z'2,  .  .  .,  Z'p] 
ne  peut  done  contenir  d'elements  d'ordres  w(Zp+l)  =  w(Yv)  —  ift. 

Considerons  main  tenant  un  ensemble  d'elements  Xlt  X2,  ...,  Xm  de  *H 
choisis  de  la  maniere  suivante:  X±  est  un  element  d'ordre  positif  le  plus 
petit  de  *H\  Xlt  X2,  ...,  X^-^  etant  choisis,  Xl  est  un  Element  de  *H  tel  que 
soit  le  plus  petit  element  de  W(*H)  qui  ne  soit  pas  contenu  dans 
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)*  ou  §z-i  designe  la  fermeture  canonique  de  k[X^  X2,  ...,  -X^i]. 
Les  elements  de  W(*H)  etant  des  combinaisons  lineaires  a  coefficients  entiers 
non  negatifs  d'un  nombre  fini  d'entre  eux,  les  elements  X1}  X2,  ...,  Xt,  ... 
ainsi  choisis  ne  peuvent  etre  qu'en  nombre  fini.  Un  tel  ensemble  d'elements 
(Xl9  X2,  .  .  .,  Xm)  sera  appele  dans  ce  qui  suit  une  base  de  *H. 

TH^OR^MB  4.  (Xlt  X2,  .  ..,Xm)  etant  une  base  de  *#,  les  entiers 
w(Xl)>w(X2),  ...,w(Xm) 

ne  dependent  que  de  *H  et  Us  constituent  une  partie  des  caracteres  de  *H. 

Demontrons  d'abord  la  proposition  suivante  qui  facilitera  la  d6monstra- 
tion  de  ce  theoreme. 

THEOREME  ATJXILIAIBE  8.  ^  etant  la  fermeture  canonique  de  k[Xlt  X2,  .  .  .  ,  X^ 
ou  Xlt  JC2,  .  .  .  ,  Xm  est  une  base  de  *H,  onpeut  choisir  les  elements  Y^  F2,  ...,YV,  ... 
de  §z  remplissant  les  conditions  de  Venonce  du  theoreme  auxiliaire  6  envisage 
pour  Vanneau  canonique  §z  (c.a.d.  w(Yj)  etant  le  plus  petit  element  de  W($j)j) 
non  contenu  dans  W(k[Yl}  Y2)  .  .  .  ,  ^J))  de  mani&re  que  la  suite  Yt,  Y"2>  •  •  •  ,  ^> 
contienne  V  ensemble  X1}  JT2J  ...,  Xj. 

Demonstration.  Pour  1=1,  on  a  visiblement  §j  =  k[X±]  et  Ton  peut 
poser  YI~  X±.  Supposons  que  la  proposition  ait  ete  demontree  pour  I  et 
demontrons  la  pour  1+1.  Soient  Yly  Yz,  ...,YV  ceux  des  elements  choisis  de 
^>?  dont  les  ordres  sont  plus  petits  que  w(Xl+1).  Les  elements  de  W(^t)  qui 
sont  inferieurs  a  w(Xl+1)  sont  alofs  les  memes  que  ceux  de  ^(A;^,  F2,  .  .  .  ,  FJ). 
Le  plus  petit  element  de  W($Ql+1)  non  contenu  dans  W(^t)  6tant  w(XM), 
posons  Yv+l  =  Xl+1,  et  choisissons  Yv+2,  Yv+3,  ...  dans  §z+1  conformement  a 
1'enonce  du  theoreme  auxiliaire  6  envisage  pour  §j+1.  La  suite 


remplit  alors  les  conditions  de  1'enonce  de  la  proposition  que  nous  voulons 
demontrer  pour  §z+1. 

Demonstration  du  theoreme  4.  Soient  Xlt  X2,  ...,  Xm  et  X[,  X'z,  ...,  X'm' 
deux  bases  de  *H.  Si  les  entiers  w(Xl),  w(X2),  ...,  w(Xm)  et  les  entiers 
w(Xrl),  w(X'2),  ...,  w(X'm.)  n'etaient  pas  les  memes,  1'un  au  moins  des  entiers 
(w(Xi),w(X9),...,w(Xm),w(X{),w(X'2),...,w(X'm,))  n'appartient  qu'a  1'un 
des  ensembles  (w(Xl),w(X2)i  ...,  w(Xm))9  (w(X'1))  w(X'2),  ...,w(X'm,)).  Soit 
w(  X'l+1)  le  plus  petit  de  ces  entiers  qui  n'appartient  qu'a  Tun  de  ces  ensembles, 
et  considerons  les  fermetures  canoniques  §z,  §j  des  anneaux  k[Xlt  X2,  .  .  .  ,  Xt], 
k[X'lt  X2,  ...,X$\.  D'apres  les  choix  des  X^  Xj  les  anneaux  §z  mod  f^n) 
$  J  mod  ff^xi+i>  sont  respectivement  identiques  a  ^mod^^+i^  *Hm.odtwf'x^. 
w(Xi+1)  6tant  par  d6finition  superieur  a  w(X'i+l),  1'anneau  §z  devrait  contenir 
un  element  d'ordre  w(  X'i+1)  .  Or  soit  (F1}  F2,  .  .  .  ,  Yv,  .  .  .  )  un  ensemble  d'elements 
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de  Jpj  choisis  conformement  &  1'enonce  du  theoreme  auxiliaire  8  et  soient 
YI}  Y2,  .  .  .,  Yv,  ceux  des  elements  de  cet  ensemble  dont  les  ordres  sont  inferieurs 
a  w(X'l+l).  Les  anneaux 

,     £z  mod  t^xi^\     & 


etant  identiques,  il  existe  des  elements  Y{,  Y2,  ...,  y£  de  fe'i  tels  que 
rj  =  Yj  mod*«W+i)     (j  =  1,  2,  ...,  j>). 

La  fermeture  canonique  de  k[Y(,  Y2,  ...,  1^]  qui  est  contenu  dans  ^>|  ne 
peut  contenir  aucun  element  d'ordre  w>(XJ+1).  Done  la  fermeture  canonique 
de  k[Yly  !F^,  .  .  .  ,  Yv]  qui  n'est  autre  que  §z  (puisque  1'ensemble  (Flf  F2,  .  .  .  ,  Yv) 
contient  1'ensemble  (XltX2,  ...,XZ))  ne  contient  pas  d'  elements  d'ordre 
w(X'i+1)  (theoreme  auxiliaire  7).  Done  w(Xl+1)  est  egal  a  tf(XJ+1)  contraire- 
ment  a  1'hypothese. 

Le  fait  que  les  nombres  w(X1),  w(X2),  ...,  w(Xn)  constituent  une  partie 
des  caracteres  de  *H  se  demontre  comme  suit:  w(Xl)  etant  le  plus  petit 
element  de  W(*H)  on  a  w(X^  =  Xi-  Supposoiis  que  w(X{)  soit  le  plus  petit 
de  ceux  des  nombres  w(X-L),  w(X2),  .  .  .,  w(Xm)  qui  ne  sont  pas  des  caracteres 
de  *H.  w(X^  serait  alors  contenu  dans  la  fermeture  canonique  du  semi- 
groupe  engendre  par  les  elements  de  W(*H]  qui  sont  inferieurs  a  w(Xl}. 
Or  les  elements  de  W(*H)  qui  sont  inferieurs  a  w(Xt)  sont  contenus  dans 
^(^1-1)-  On  a  done  w(X{)  e  TF(§z_!)  contrairement  au  choix  des  Xj. 

Dans  ce  qui  suit  nous  allons  appeler  les  nombres 

w(XJ  =  *Xl,  w(Xz)  =  **,,  ...,  w(Xm)  =  'Xm 

les  caracteres  de  base  de  *H.  De  la  definition  d'une  base  de  *H  et  du  theoreme  4 
resulte  immediatement  que  tout  systeme  d'elements  *X^  *X2,  ,  ..,  *JCm  de  *H 
tels  que  w(*X±)  —  *%1?  w(*X2)  =  *^2,  ...,  w(*JTm)  =  *^m  constitue  une  base 
de*^. 

Un  ensemble  d'elements  Ylt  Yz,  ...,YvdeH  s'appelle  un  systeme  de  genera- 
teurs,  si  *H  est  la  fermeture  canonique  de  k\Y^  —  ^15  F2  —  ijz>  ...,YV  —  ijv]  ou 
r)^  5/2,  ...,  KIV  designent  les  termes  constants  de  Yl)  Y2,  ...,  Yv. 

X1}  X2,  .-,  Xm  etant  une  base  de  *H,  considerons  un  ensemble  d'elements 
Yl9  F2,  .  .  .  ,  Ym  choisis  de  la  maniere  suivante  : 


SEE.  2.     VOL.  50.     NO.  2396. 
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oti  &{  designe  la  fermeture  canonique  de  1c[X^X&  ...tX^\\  les  Elements 
Ylt  ya,  •••>  Ym  constituent  visiblement  un  systeme  de  generateurs  de  *H. 
Inversement  tout  systeme  de  generateurs  contient  un  ensemble  partiel 
choisi  de  cette  maniere.  En  effet  Tlt  Y2,  ...,YV  etant  un  systeme  de  genera- 
teurs de  *H,  designons  par  9/1,  i/a,  ...,  rjv  les  term.es  constant  de  Ylt  F2,  ...,YV. 
L'un  au  moins  des  entiers  w(Yl  —  7/jJ,  w(Y2  —  ^2),  ...,  w(Yv  —  yv)  est  alors  egal 
a  *Xi*  s°i*  w(Yi  —  Vi)  =  *Xi*  On  peut  done  poser  Xx  =  Fx  —  iyv  W(^j)  contenant 
tous  les  elements  de  W(*H)  qui  sont  inferieurs  a  *#2,  on  peut  choisir  X^  e  $lt 
de  maniere  que  Ton  ait 


L'un  au  moms  des  entiers  ttf(T^  —  Jf^)  est  egal  a  *^2;  car  sinon  la  fermeture 
canonique  de  k[Xl)Y2  —  X'2,...,Yv  —  X/v]  qui  est  par  definition  identique  & 
*H  ne  contiendrait  pas  d'elements  d'ordre  *^2-  Soit  w(Y%  —  X'2)  —  *^2.  On 
peut  alors  poser  X2  =  Y2  —  X'2  et  ainsi  de  suite.  II  resulte  de  ces  considerations 
que  tout  systeme  de  generateurs  de  *H  contient  au  moins  ra  elements, 
m  etant  le  nombre  des  caracteres  de  base  de  *£T;  nous  1'appellerons  le  nombre 
de  dimension  de  *H. 


7  .    *H  =  k  +  kT±  +  kT±  T2  +  .  .  .  +  k[T]  7\  T2  .  .  .  TN^  etant  un  anneau  canon- 
ique, les  caracteres  ainsi  que  les  caracteres  de  base  des  anneaux 

...  +  k[T]  TMTM  ...  TN_, 

eres  de  *Hh  so 

par  ceux  de  *H.  Mais  il  n'en  est  plus  de  meme  des  caracteres  de  base  de  *Hh. 
Considerons  par  exemple  les  anneaux 

*H  =  k  +  &*"(1  +  t)  +  kt6v(l  +t)  +  ktlv(l  +  1)  +  k[t]  t*",) 
*H'  =  k  +  ktto  +  kt*v(I+t)  +  ktlv(I+t)  +  k[t]tSv.  J 

On  verifie  facilement  que  ces  anneaux  sont  tous  les  deux  canoniques  et  que 
leurs  caracteres  qui  sont  ceux  du  semi-groupe 

W(*H)  =  W(*H')  =  {0,4^,6v,7^8v+l,8v  +  2,8j/-f-3,  ...} 

sont  les  memes.  Ces  caracteres  sont  visiblement  4v,  6^,  Iv,  8^-f-l.  Con- 
struisons  maintenant  une  base  de  *H:  On  peut  evidemment  poser 
X1  =  ^"(1  +f);  klX^]  est  un  anneau  canonique  et  le  plus  petit  element  de 
W(*H)  non  contenu  dans  ^(Ajf-STJ)  est  61^;  on  peut  done  poser  X2  =  ^6"(1  -f  t). 
La  fermeture  canonique  de  k[Xt,  Xz]  est 


On  peut  done  choisir  X3  =  ^7"(1  -f  i)  comme  troisieme  element  de  base  de  *H. 
La  fermeture  canonique  de  k[Xv  X2,  X^]  etant  alors  *H,  les  caracteres  de 
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base  de  *H  sont  4i>,  6v,  lv.  On  observe  de  la  meme  maniere  que  les  elements 
X(  =  t*",  X'z  =  J6"(l+0,  X'B  =  £7"(1  +  0  constituent  une  base  de  *H'  '.  Les 
caracteres  de  base  de  *H  et  de  *H'  sont  done  les  memes.  Calculons  main- 
tenant  les  caracteres  de  base  des  anneaux 


v  +  k\f\ 
*H{  =  k  +  k 

Une  base  de  *H[,  est  constitue  par  t2v,  I3",  t^+l  tandis  que  les  elements 
t2v(  1  +  0,  £3"(  1+0  constituent  une  base  de  *H[,  puisque  la  fermeture  canonique 
de  fcp2^!  +0,$3"(1  +01  contient  1'  element 


dont  1'ordre  est  4y  +  1  .  Les  caracteres  de  base  de  *Hlt  sont  done  2v,  3p,  4y  +  1 
tandis  que  ceux  de  *H[  sont  2i^,  3v. 

Les  caracteres  de  base  des  anneaux  [7^  ]  =  *Hh  constituent  done  des  elements 
invariants  nouveaux  pour  *H. 

Les  considerations  qui  suiveiit,  permettent  de  determiner  successivement 
les  caracteres  de  bases  des  *Hh.  Considerons  un  element  quelconque  d'ordre 
positif  de  *H.  Soit  T  cet  element  et  soit  (Xl,X2,  ...,Xm)  une  base  de  *H. 
Designons  par  *Xi  ^e  P^us  petit  des  nombres 


tel  que  la  fermeture  canonique  de  k[XltX2,  ...,Xi_1,  T]  contienne  un  Element 
d'ordre  *Xt-  Les  elements  T,  TXlf  TX2,  ...,  TXt_l9  TXM,  ...,  TXm  con- 
stituent alors  une  base  dek  +  *HT  qui  est  canonique.  En  effet 


etant  la  fermeture  canonique  de  k^X^X^  ...,Xi_l, Xi+l,  ...,Xm,  T],  la 
fermeture  canonique  de  k[T,  TXly ...,  TX^  TXi+lt ...,  TXm]  contient 
visiblement  1'anneau 


Comme  k[Xlt  JC2,  ...,Xi_lLt  Xi+1,  ...,Xm,  T]  contient  un  element  d'ordre  *;&, 

•yr       ~ Y"     ~  ~ v"        --•  ~    "y       rjri  *  TT 

^j-2>  •••)-^*-j lJ"^^i+l>  •••>-"-w>       J   —     xz  . 


La  fermeture  canonique  de  k[T,  TX^  ...,  TXt_l9  TXi+L,  ...,  TXm]  est  done 
identique  a  ^     /p*jy 

qu'elle   contient;   puisque  1'anneau  k[T,  TX19 ...,  TX^,  TXM, ...,  TXm] 
est  lui  meme  contenu  dans  k+  T*H.  Done  pour  montrer  que  les 

T,  TX,,  ...,  TXt_,,  TXM,  ...,  TXm 

S2 
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constituent  une  base  de  k  +  T*H  il  suffit  de  montrer  que  les  fermetures 
canoniques  des  anneaux 

k[T,TX1,...,TXi'\  (l«j<»-l) 


k(T,TX1,...,TXi_1,TXM,...,TXh] 
sont  depourvus  d'elements  d'ordres  respectifs 

w(TXj+1),     w(TXi+1),     w(TXh+1). 
Or  ces  fermetures  sont  respectiveinent  identiques  a 


k  + 


ou  les  symboles  surlignes  designent  toujours  les  fermetures  canoniques  des 
anneaux  correspondants.  II  suffit  done  de  montrer  que  les  fermetures 
canoniques  des  anneaux  k[Xl9  X2,  ...,  Xp  T],  k[Xl7  X2,  ...,  X^_15  T], 
k[X1,X2,...,Xi_l,Xi+1,...,Xh,T]  ne  contienent  pas  d'elements  d'ordres 
respectifs  w(Xj+l),  w(X,i+1),  w(X7l+l).  Or  le  fait  que  la  fermeture  canonique 
de  k[X±,  Xz,  .  .  .  ,  X  p  T]  pour  j  <  i  —  1  ne  contient  pas  d'elements  d'ordre 
w(Xj+1)  est  implique  par  la  definition  de  i.  Si  1'anneau 


contenait  un  element  d'ordre  w(Xi+l)  ou  1'anneau 

k[Xl7  X2,  .  .  .  ,  Xt_^  Xi+l9  ...,  Xh,  T] 
un  element  d'ordre  w(Xh+l),  la  fermeture  canonique  de  1'un  des  anneaux 


k[X1,Xt,...,Xi_l9XM,...,Xh,XM,...,Xm,T],    pour    A<m-l, 
k[X1,X2,...9Xi_li,Xi+l,...,Xm_1,T],  pour    h  =  m-l, 

contiendrait  un  systeme  d'616ments  d'ordre  respectifs  *^1?  *^2>  ...,  *Xm  e^ 
par  consequent  une  base  de  *H.    Ce  qui  impliquerait  1'existence  d'un 
systeme  de  generateurs  de  *H  contenant  m  —  1  elements  settlement,  con- 
trairement  ^  ce  qui  a  ete  etabli  plus  haut  (voir  §  6). 
Les  caracteres  de  base  de  k  4-  T  *H  sont  done 

w(T),  w(T)  +  *Xi,  w(T)  +  *x*>  -,  w(^)  +  *A:<-i,  w(T)  +  ^i+1,  ...,  w(T)  +  *Xm- 
Comme  les  caracteres  de  base  de  k+  T*H  ne  dependent  pas  du  choix  des 
elements  Xly  X2,  ...,  Xm,  le  nombre  *Xi  ne  depend  que  de  T  et  de  *H  seule- 
ment.  Nous  aliens  le  designer  par  *Xi  =  *X(T,  *H). 
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D'une  maniere  analogue  les  caracteres  k  +  T  *H  se  d^duisent  de  ceux  de 
*H  par  les  expressions 

w(T)>  Xi  +  MZ1).  •••>  tt  +  wKZ').     P°ur    w(r)*Xi>  Xa>  •••>  &' 
et  par 

w(^),  Xi  + w(^).  ...,  A^.i  +  wfZ7),  ^+i  +  w(^),  •••»  tt  +  wKZ7).  P°ur  ^(^)  =  AV> 
en  designant  par  Xi>  Xz>  •-•»  Ai  les  caracteres  de  *//. 

En  partioulier  dans  le  cas  oh  tons  les  caracteres  de  *H  sont  aussi  ses 
caracteres  de  base,  tous  les  caracteres  de  k  +  T*H  seront  aussi  ses  caracteres 
de  base  si  w(T)  est  un  caractere  de  *H  ou  si  x(T,  *H)  est  infini. 

Remarque,  p  etant  un  Element  quelconque  de  W(*H),  on  peut  toujours 
choisir  1'element  T  d'ordre  w(T)  =  p  de  "".fir,  de  maniere  que  #(T,*//)  soit 
egal  a  Fun  quelconque  de  ceux  des  nombres  *;&,  *^2'  •••»  *Xw»  *^m+i  =  °°  ^ui 
d^passent  p,  pourvu  que  p  soit  different  des  nombres  *Xi-  Supposons  en 
effet  que  p  soit  distinct  des  nombres  *Xi  <  *Xz  <  •  -  •  <  *Xm  et  soit  *A&  ce^ui  ^e 
ces  derniers  pour  lequel  on  a  *Xi<P<*Xi+i'  ^i  Xx,  JC2'  •••>  -^m  est  une  ^ase 
de  *H,  la  fermeture  canonique  de  k[Xl9X2,  ...,'Xt]  contient,  par  definition 
des  elements  d'ordre  p.  Soit  T'  1'un  de  ces  elements,  et  posons  T  =  T'  +  -2T & 
(avec  A > Z,  JCm4.t  =  0).  Pour  l^j<h—l  les  ensembles 


etant  identique,  1'anneau  ^[^1?  X2,  ...,JT^,  T]  n'a  pas  d'elements  d'ordre 
w(XM)  =  *x/+i-  Pourj<Z,/9  =  w(T)  etant  plus  grand  que  *^y+i»  les  ensembles 


mod  r^i+i,     ^^T^TXjJ  mod  ** 


sont  identiques  et  par  consequent  k[Xl9  X2,  ...,Xit  T]  ne  contient  pas 
d'elements  d'ordre  w(XM).  Par  contre  1'anneau 


qui  contient  Ferment  T',  contient  aussi  1'element  T—T'  =  Xh.    On  a 

ono^r,*^)-*^- 
Considerons  maintenant  un  semi-groupe  canonique  quelconque 


Le  semi  groupe  *<3jv-i  =  {®^} 

a  visiblement  un  seul  caractere  qui  est  ^(izv~1)  =  v.  Les  caracteres  de 

*GW-i  =  {0,^-i  +  «"} 
sont  alors,  d'apres  la  regie  indiquee  plus  haut, 
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Les  caracteres  de  *GN_3  se  deduisent  des  precedents,  d'apres  la  meme  regie: 

I     pour     v 

%_2  +  *V-1  +  ^  J 


P°ur 


On  en  deduit  success!  vement,  en  appliquant  toujours  la  meme  regie, 
les  caracteres  (t-}     „,  ^ 

Al  '  A,2  >   •••>  A*i 

de  tous  les  semi-groupes     *6r^  =  (0,  vi+1  +  $f+i}. 
Posons  maintenant 


et  d'une  maniere  generate 


V/f+i1'  =  «•«  +  *^',  •  .  .  ,  Vtf  =  "i  +  *^,  pour  *A,  =  %  +  i  , 
oil  *&t-  est  Tun  quelconque  des  en  tiers  positifs  h  ^  %  +  1  pour  lesquels  on  a 
^  <  *xJ)  avec  *#(4+i  =  oo,  si  ^  4=*%i)J  •••»  ^4'  smon  *%%  est  cenii  des  nom- 
bres  VA  'Af,  -,  V^  q«i  est  egal  &  ^. 

De  la  remarque  precedente  et  des  considerations  qui  la  precedent  resulte 
immediatement  qu'on  peut  toujours  choisir  les  elements  Tt  e  *Z^  de  maniere 
que  les  caracteres  et  les  caracteres  de  base  des  anneaux 

•HM  =  k[T],  w(T)  =  v, 

*HN_2  =  k  +  *HN_iTN_»     w(TN^)  =  i/^j, 


*1?  =  *^0  =  jfc  +  ^21,  t^(^)  =  ^ 

soient  respectivement 

(Les  caracteres)  (Les  caracteres  de  bases) 


v(0)  v(0)  v(0).        *v(0)  *v(0)  *v(0) 

Al  ?  A2  >  ...,...,  A/.  >          Al  >  A2  >          ...,...,  X*^. 


1945]  UNE  INTERPRETATION  ALGEBRIQUE  279 

En  particulier  les  caracteres  de  base  de  *H  =  *H0  coincident  avec  see 
caracteres  si  et  seulement  si  Ton  a  choisi  *&$  =  %  4-  1  ,  chaque  fois  qu'on 
a  eu  a  en  faire  le  choix;  le  nombre  de  dimensions  de  *H  serait  alors  le  plus 
grand  des  dimensions  des  anneaux  canoniques  ayant  les  memes  caractdres. 

THEOREME  5.  Si  les  caracteres  be  base 


out  ete  construits  en  posant 

*A!%  —  le  P^us  petit  de  ceux  des  nombres  *$\  *X$\  •  •  •  >  *A!*(J+i 

gwi  we  sont  pas  inferieurs  a  vp 

le,  nombre,  de  dimensions  de  I'anneau  correspondant  *H  est  le  plus  petit  possible 
parmis  les  dimensions  des  anneaux  canoniques  ayant  les  memes  caracteres. 
Demonstration.  Soient 

tvUV-l).   tvUV-2)    tv(AT-2).          ,   tv(i-l)    tv(*-l)  tv(t-D. 

Al  »     Al          5     A2          ?   •••»     Al        J     A2        »   •••»     AT^_I  »    ••• 

un  autre  systeme  de  caracteres  de  bases,  deduits  des  memes  nombres  v^ 
Nous  avons  a  montrer  que  Ton  a  *lt  ^  %  (i  =  2V  —  1  ,  JV  —  2,  .  .  .  ,  0).  y  etant  un 
entier  quelconque,  designons  par  \(v)  le  nombre  de  ceux  des 

*v«)  *v«)          *v(*> 

Al  9     A2  ?    •••>     A*& 

qui  ne  sont  pas  inferieurs  a  v.  Soit  de  mSme  %(v)  le  nombre  de  ceux  des 
*Xi\  tA^>>  •  •  •  >  f/$J  ^^  ne  sont  pas  inferieurs  a  v.  Nous  aliens  d^montrer,  en 
meme  temps,  que  Ton  a 

*««•)  -Ti 

Les  6galites  ^^  =  *!,,_!  =  1, 

^-i  -  *  V-i  =  ^w-it")  -  V-i(")  =  0. 
etant  evidentes,  il  suffit  de  deduire  de 


les  inegalites       */<_!  ^  %_l}     \_^(v) 
Nous  distinguons  les  cas  suivants: 

(1)  ^  =  %     *xJ 

(2)  ^>*I,, 

(3)  ^>%,      ^inflni,     '^  infini; 

(4)  \>"lt, 

(5)  '*>% 
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(1)  f^4  etant  fini,  f^(^)  n'est  pas  mil.   tM^)-*^W  etant  inferieur  ou 
egal  a  \  —  *lt  =  0  le  nombre  %(^)  eat  non  nul.  Done  *x^h€  es^  ft*1*-  H  en 

resulte  que  Ton  a  t/  t/       *T       *7 

*i_i  =   *i  =   t<  =   *<_!- 

Montrons  qu'on  a  encore, 

Vi<")  -  Vi<")  <  Vi  -  Vi  (  =  °) 

pour  tous  les  v.  D'apres  les  formulas  de  recurrence 


il  est  clair  que  Ton  a 


Vi(")  -  f^(^  -  Vi)  -  1  ,  pour 

^i-i(^)  «  ^(^  -  vt),  Pour  ^  +  ^  <  i;, 

*^-i(^)  —  %»  Pour  ^  ^  ^<» 

%-i(v)  =  *W^  -  ^)  -  1  ,  pour  ^  <  v  ^  vi  +  V^, 

%-i(")  =  %(^  -  *<)»  P°ur     "i  +  *^i  <  I'- 

ll en  resulte  que,  pour 

v<^  +  min(^,V^)     etpour 
on  a  Vi(^)  -  Vi(^)  =  %(*  - 

Si  V^  <  Vt^,  on  aura  min  <*;$,  ^)  =  *^f  max  (^,  ^)  =  ^  et 

ViM  ~  Vi(")  =  %(*>  ~  »i)  -  %(*  ~  "*)  -  K  o 

(pour  vt  +  *^  <  p  <  i/i  +  ^tt)- 

Si  f^ft<  <  *X%hi>  vi  ^tant  inferieur  ou  6gal  a  f^^,  il  n'existe  aucun  nombre 
*^  compris  entre  f^  et  *^.  On  a  done  pour  ^  +  VtA, 


(2)  *li^*lit  ^hi  i11^11^  *A:*^  fi^-     Dans  ce  cas  on  aura  visiblement 
{_^  =  f^+l,  %_i  =  %>  et  par  consequent  ^t_i>%_i.    Des  formules  de 
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recurrence  qui  fournissent  les  nombres  tx^"~1)  et  *;$~"1>  resulte  en  outre 

que  Ton  a        tr     ,  x      t,   t  _ 
= 


t  _ 

=      +  *»  Pour 

%-i(v)  —  %(v  —  Vi)>  pour 

\-\(v)  =  %>  Pour 

ViM  =  %(*  -  ^i)  -  1  »  Pour     v,  <  v  ^  +  V4,» 

.    Vi(^)  =  %(^-^),  Pour     Vt  +  *^<v9 
dont  on  deduit  facilement  1'inegalite 


(3)  Pour  f^  ^  ^,  Vt^  infini,  *#$f^  infini,  il  est  clair  que  Ton  a  t^_1  =  %  -f  1  , 
%_!  =  %+  1  et  par  consequent  t^_1  ^  *^_!-  Des  formules  de  recurrence  qui 
fournissent  les  nombres  t;~~1),  *X~X)  resultent  en  outre 


Vi(^)  =      +l,  i(")  =  *+l»          Pour 

\-i(v)  =  +^("  -  Vi)>     %-i(v)  =  %("  ~  Vi),     pour     ^  <  v, 
d'oii  Ton  tire  \-\(v)  —  %-i(^)  ^  f^~i  —  %-i- 

(4)  \  >  \,  f%(t^  fini,  *^5g,  infini.  On  aura  alors 

Vi=^     Vi  =  «<+!, 

Vi(^)  =  *h  *k-i(v)  =  %  +  1,  Pour     i/  <  vt, 

Vi(^)  =  t^-^)-l»     ViM-^i'-Vi),     pour     v«  <?<!>!  +  *;$,, 
Vi(")  =  *W"  -  "<)»  Vi(^)  =  *W^  -  "*)>     Pour     ^  +  ^  <  v, 

et  par  consequent 
t?      •>  */ 

^i-\^   t'i-u 

Vi(»)  -  Vi(^)  =  Vi  -  Vi>  Pour     ^  ^  vit 

Vi(^)  -  Vit*')  =  **<("  ~  "<)  -  %("  -  Vt)  -  1  »     Pour     v,  <  v  *S  ^  +  Vtt- 

<r  t7  *7 

^    *<-l~   ^-1> 

fXt24  etant  fini  mais  superieur  ou  egal  pt  tandis  que  *^c*^  est  infini,  on  aura 

ViM==%(^-^)  =  °J     Pour     0^+^, 
et  par  consequent 

ViM  -  ViM  =  Vi(")  <  Vi("«  +  Vt^),     pour     vt  +  *X%,  <  ••, 
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(6)  %  >  *li}  ^  fini,  *#(4,  fini-  Vsms  ce  cas  les  inegalites 

Vi  ^  %     Vi(")  ~  Vi(")  <  Vi  -  Vi 

se  deduisent  de  ^^"Vi,  ^(y)  —  *^(^)  <  f^  —  *^i  exactement  de  la  meme 
maniere  que  dans  le  cas  (1). 

10  6  tant  le  nombre  des  caracteres  de 


*/0  le  nombre  des  caracteres  de  base  *^>,  *^(20),  .  .  .  deduits  de  *G  conformement 
a  1'enonce  du  theoreme  5,  nous  venons  de  voir  que  le  nombre  des  caracteres 
de  base  d'un  anneau  canonique  *flT,  tel  que  W(*H)  —  *$,  est  compris  entre 
*70  et  10.  Inversement  on  a 

THEOREME  6.  n  etant  un  entier  quelconque  compris  entre  *1Q  et  1Q  il  existe 
un  anneau  canonique  de  dimension  n  dont  les  caracteres  sont  ceux  de*G. 

Demonstration.  II  suffit  de  deduire  de  1'existence  d'un  anneau  canonique 
de  dimension  n  —  1  celle  d'un  anneau  canonique  de  dimension  n.  Supposons 
done  qu'il  existe  une  systeme  de  caractere  de  base 

tvCV-l).    tvOV-2)     tv(2V-2).          .    tv(0)     tv(0)  tv(0) 

Al  >     Al  >     A2  ?    •••>     Al  J     A2  J    •••?     Atfo 

deduits  de  *Q  suivant  les  regies  indiquees  plus  haut  et  que  1'on  a  fZ0  =  n  —  1. 
Le  nombre  fZ0  etant  plus  petit  que  Z0,  il  existe  des  entiers  i  pour  lesquels 
fA!%est  fini  sans  etre  6gal  a  j^;  soit/*  le  plus  petit  de  ces  entiers.  Noils  pouvons 
supposer  que  le  systeme  de  caracteres  de  base 


ait  ete  choisi  parmis  les  systemes  qui  remplissent  les  memes  conditions,  de 
telle  maniere  que  /*  soit  le  plus  grand  possible.  Cela  etant,  posons 


2, 


+7'       —  ^7        -L.  1  tv/(«-i)  _  t^-D    tv'(«-i)  _  t^X^-i)  t   '(/i-l)  _  tv(/»-D 

V-l  ~     */fr-l  +  X  3  ^1  ~     ^1         '     ^2  ^         —     A2         >   •  •  '  >     Xt  AM         —     Xt/iM     > 

tv'</»-D  _  y      .    tv(/*>      tvx>-D  _   tv(^-D 
—  - 


avec  fXt/^  =00.    L'ensemble   ^i^"1^   t^i(/*~1)»  •••>  ^X^~l}  es^  visiblement 
constitu6   par  1'ensemble    *tff'~'L\  ^^z~!\    •••»    ^17^   et   par   le   nombre 

^^i  =  ^+  ^V    Le  nombre  Vi  ne  Peut  Paa  ^re  ^al  a  ^t+r    Car 
sinon  on  aurait  *x*h~-l  =  °°>  *X*h~*l  ~  *X*h~+i  e^  ^e  systeme  correspondant 
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serait  eompos6  des  memes  nombres  que  le  systeme 


Ceci  nous  permetterait  alors  de  construire,  en  posant 

I)  tv'</»-4)  _   tv(/-4). 

J    "•'      A>;       4      ~      At/       4J 


un  systeme  de  caractere  de  base  f^i(JV-1);  ...;  f^i<0),  ^2^  •••>  f  A*t/^  remplissant 
les  memes  conditions  que  le  systeme  Vi^"1^  •••'»  t#?)>  ^^  •••>  f^  sauf  que 
les  f^tjj|  sont  infinis  ou  egaux  a  vi  pour  i  =  1,2-,  ...,/*  —1  et  /«.  Done  si 
tXtjJ,''  ost  le  premier  des  nombres  *;$$  q^i  n?est  ni  infini  ni  egal  a  v^.,  on 
aurait  fi'  >  p,,  contrairement  au  choix  du  systeme 

tvW-l).    tv(JV-2)     tv(JV-2).          .    tv(0)     tv(0)  tv(0) 

Al  >     Al  >     A2  >    •••>     Al  >     A2  >    •••>     Aty/ 

^^-i  etant  ainsi  different  de  *X*h7+'i  Qu*  es^  ^e  seu^  ^es  nombres  t^J(/t~1)  qui 
ne  soit  pas  £gal  a  un  nombre  tx^"~1)  on  peut  poser  f;$£lf  =  f^t$~^  et  con- 
siderer  Tensemble 


qui  est  alors  compose  des  nombres 

vr2),  *#-*,  -,  *<:? 

et  de  ^tft6^.1!  +  vn-\  =  ^u  +p/t  +  v/t-i-   ^n  montre  de  meme  que  ^_2  est 

distinct  de  +Xtvfi  +  Vi«  ce  ^ui  nous  Permet  de  Poser  *;$£?=  ^t^. 
En  continuant  de  cette  maniere  on  construit  finalement  le  systeme 

tY'(o>   tv'(o)  tv'<9) 

Al     >     A2     >   "•>     At?0 

qui  se  compose  de  ^0),  V20)»  ...,  f^ 

et  du  nombre  *x*h  +  VP  ~*"  v  n  -i  "*"  •  •  •  +  vv  ^n  a  done 


Le  tableau  ci-contre  montre  les  systemes  de  caract&res  de  base  qui 
correspondent  au  semi-groupe 

*Q  =  (0,  702,  1404,  1620,  1836,  2052,  2106,  2160,  2214,  2268, 

2322,  2340,  2358,  2376,  2383,  2390,  2394,  2397  +  (5$); 

« 
la  premiere  colonne  du  tableau  etant  en  meme  temps  le  systeme  des 

caracteres  de  *G. 
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1*"  colonne 

2!lme  colonne 

3ime  colonne 

4 

"a*  colonne 

58mec 

olonnt 

H 

1 

1 

1 

1 

1 

Hu 

3 

4 

3          4 

3 

4 

3 

4 

3 

4 

#15 

4 

7 

4          7 

4 

7 

4 

7 

4 

7 

#14 

7 

11 

7        11 

7 

11 

7 

11 

7 

11 

#13 

7 

18 

7        18 

7 

18 

7 

18 

7 

18 

18 

25 

18        25 

18 

25 

18 

25 

18 

25 

#11 

18 

43 

18        43 

18 

43 

18 

43 

18 

43 

#10 

18 

61 

18        61 

18 

61 

18 

61 

18 

61 

#0 

54 

72      115 

54        72 

115 

64 

72 

115 

54 

72 

54 

72 

#8 

54 

126      169 

54      126 

169 

54 

126 

169 

54 

126 

54 

126 

#7 

54 

180      223 

54      180 

223 

54 

180 

223 

54 

180 

54 

180 

#6 

54 

234      277 

54      234 

277 

54 

234 

277 

54 

234 

54 

234 

#5 

54 

288      331 

54      288 

331 

54 

288 

331 

54 

288 

54 

288 

#4 

216 

270      504      547 

216      270 

547 

216 

270 

504 

216 

270      504 

216 

270 

#3 

216 

486      720      763 

216      486 

763 

216 

486 

720 

216 

486      720 

216 

486 

#2 

216 

702      936      979 

216      702 

979 

216 

702 

936 

216 

702      936 

216 

702 

#1 

702 

918    1638    1681 

702      918 

1681 

702 

918 

1638 

702 

918    1638 

702 

918 

ff 

702 

1620    2340    2383 

702    1620 

2383 

702 

1620 

2340 

702 

1620    2340 

702 

1620 

Comme  exemples  d'anneaux 

H  dont  les  caracteres  sont 

702,  1620,  2340, 

2383  on  peut  citer  les  suivants: 

£f£702    £1620    £2340 

,  *2383] 

k[tm(l  +  t™)*,t 

L020(1  +  p)7j 

*2388( 

1+P)9] 

k[t™(l  +  t™)*,t 

1620(1-M115) 

7    £234< 

Hi-fi115)9] 

*P70i(l  +  '7)li,«1 

620(1-K7)30, 

/2340/ 
1         \ 

l-H7)44] 

)13  (i+e79)8,  #620(i+z7)3  (i+*79)7] 

dont  les  suites  des  caracteres  de  bases  sont  respectivement  donnes  par  les 
cinq  colonnes  du  tableau  ci-dessus. 

Remarquons  enfin  que  les  caracteres  de  *H  et  les  caracteres  de  base  de 
*H,  *Hlt  ...,  *HN-I  <lui  sont>  comme  nous  1'avons  vu,  des  invariants  de  *#, 
ne  constituent  pas  un  systeme  complet  d'invariants.  C'est  &  dire  on  peut 
construire  des  anneaux  canoniques  *H  et  *Hf  non  transformable  1'une  dans 
1'autre  par  une  substitution  de  la  forme 


de  maniere  que  les  caracteres  de  *H  et  de  *H'  ainsi  que  les  caracteres  de 
base  de  */7,  *Hlt  ...,  *HN_^  et  de  *H',  *H{,  ...,  *H'N_l  soient  respectivement 
les  memes.  Soient  par  exemple 


*H'  = 

avec  v  >  2.  Ces  anneaux  ont  les  memes  caracteres  qui  sont 
4v,  6y,  7»>, 
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Leurs  caracteres  de  base  sont  egalement  les  memes: 

4v,  6v,  7v. 
Les  anneaux  *H±,  *#i  etant  tous  les  deux  identiques  a 


ies  caracteres  de  base  de  *H[,  *H'2,  *H!3,  *//4  sont  respectivement  les  memes 
que  ceux  de  *Hlf  *H2,  *H3,  *#4.  Par  contre  il  n'existe  aucune  substitution 
de  la  forme 

(a)  t->t(ot0  +  alt  +  a2t*+...) 

qui  transforme  *H  en  *#'.  En  effet  une  telle  substitution  qui  doit  transformer 
*H  en  *H'  devrait  transformer*/^  en  *H(,  c.a.d.  enluimeme.  Or,  ensupposant 
que  2v  est  non  divisible  par  le  caracteristique  de  k,  les  substitutions  de  la 
forme  (a),  qui  transforment  1'anneau 


+  k[t]  t* 
en  lui  meme,  sont  de  la  forme 

t  ->  t(a0  +  av  tv  +  oc2v  t2v  +  oc2v+i  t2v+l 
dont  aucune  ne  transforme  Felement 


de  *H  en  un  element  du  meme  ordre  de  *H'  qui  est  de  la  forme 

£0(J4"  +  t^+l  +  J4"+2)  -f  ^(tQv  +  i6^1  +  J6"+2)  +  .  .  .  . 

8.    Considerons  maintenant  une  branche  alg6brique  passant  par  Torigine 
et  d^fini  par 

)f  ...,  Yn  «  yn(o, 


ou  Fif^),  T2(f  )3  .  .  .  ,  Tn(^)  sont  des  series  entieres  en  t,  dont  les  termes  constants 
sont  nuls.  Considerons  1'anneau  k\Yi(t)  ,  Yz(t)  ,  ...,Yn(t)~\.  Nous  pouvons 
admettre  que  cet  anneau  contient  des  elements  de  tous  les  ordres  qui 
depassent  un  nombre  suffisamment  grand  (theoreme  auxiliaire  2). 


THEOREME  7.  *H  etant  la  fermeture  canonique  de  k[Y^(t)  ,Y2(t)  ,  .  .  .  3Yn(t)} 
soit  W(*H)  =  {0,  ^1?  v±  H-  v2)  .  .  .  ,  ^  4-  ^2  +  •  •  •  +  ^jv-i  +  ®}-  ^e5  ordres  de  multi- 
plicites  des  points  successif  de  la  branche  I^(<), 


Demonstration.  Soit  ^(l'i(i))  le  plus  petit  des  nombres 
),  w(Y2(t)),  ..., 
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Le  point  O  =  (0,  0,  ...,  0)  est  alors  un  point  multiple  d'ordre  w(YL(t)). 
D'autre  part  il  est  clair  que  w(Y1(t))  =  v-±.  II  suffit  done  de  montrer  que  les 
ordres  de  multiplicite  des  points  successifs  (t  =  0)  de  la  branche* 


qu'on  deduit  de  Y-^t),  Y2(t),  ...,  Yn(t)  en  resolvent  le  point  O,  sont 

v2,  v9,  .-.,  VN-I,  1,  1,  .... 

Nous  transportons  Forigine  des  coordonnes  au  point  t  =  0  de  la  branehe 
Y[(t),  Y'2(t),  ...,  Y'n(t),  qui  devient  alors 

FiW-^i,  r;(0-^f  ...,  Y'n(t)-r)n 

011^,9/2,  ...,  7jn  designent  les  termes  constants  des  series  Y'^t),  Y'z(t),  ...,  Y'n(t). 
IVj  etant  Fid6al  de  A;[F1(f),Tr2(i),  ...,Yn(t)]  forme  par  ses  616ments  d'ordres 
superieurs  ou  egaux  a  vlt  il  est  evident  que 


Or  nous  savons  que  *H  =  k  +  Y^t)  [Jv  ] 

et  que  W([IV$  =  {0,  v2,  v2  +  1/3,  .  .  .  ,  v2  +  ^3  +  .  .  .  +  ^ 

Done  1'origine  est  un  point  multiple  d'ordre  y2,  pour  la  branche 

ri(0-?i,  Y'2(t)-K,  ...,  r;(«)-^n; 

autrement  dit  le  plus  petit  des  nombres 


est  v2.  On  acheve  la  demonstration  du  th^oreme  5  en  repetant  un  nombre 
quelconque  de  fois  ce  raisonnement. 

D'apres  le  theoreme  3  les  nombres  vlt  vz,  ...,  ^^-\>  ...  se  d^duisent  des 
caracteres  de  *H  exactement  de  la  meme  maniere  que  ces  nombres,  con- 
sideres  comme  ordres  de  multiplicites  de  la  branche,  se  deduisent  des 
caracteres  de  Du  Val  associes  a  la  branche  Y^t),  Y2(t),  Y3(t),  ...,  Yn(t).  Done 
les  caracteres  de  Du  Val  de  cette  branche  sont  les  memes  que  ceux  de 


II  est  evident  que  si  deux  branches 


passant  par  rorigine  peuvent  etre  transformees  Tune  dans  1'autre  par  une 
transformation  birationnelle  reguliere  a  1'origine,  les  anneaux 

),  ...,  Yn(t)],     kiZS),  Zs(t),  ...,  Zm(t)] 


*  Voir  P.  DuVal,  loc.  cit.et  J.  G.  Semple,  "Singularities  of  space  algebraic  curves", 
Proc.  London  Math.  Soc.  (2),  44  (1938),  149-174. 
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sont  identiques  ou,  d'une  maniere  plus  precise,  transformables  Tune  dans 
1'autre  par  une  substitution  de  la  forme  ^->^(a0  +  a1^+...))  (a04=0)  et 
r^ciproquement.  Nous  dirons  alors  que  ces  deux  branches  sont  reguliere- 
ment  equivalentes  entre  elles.  Pour  deux  branches  regulierements  equi- 
valentes,  les  anneaux 


,.  ..Jn(t)l    *H'  =  k[Z^),Z2(t),...,Zm(t)} 

peuvent  evidemment  etre  transformed  Fun  dans  1'autre  par  une  substitution 
de  la  forme  /->^(a0  +  a1^  +  ...)>  (<%04=0);  mais  de  1'identite  *H  =  *Hr  on  ne 
peut  pas  d^duire  1'identite  des 


Nous  dirons  que  les  deux  branches  donnes  sont  canoniquement  equi- 
valentes si  Ton  a  *H  =  *H'.  Deux  branches  regulierement  equivalentes 
sont  aussi  canoniquement  equivalentes  sans  que  1'inverse  soit  nec^ssaire- 
ment  vrai.  Les  caracteres  de  *H  ainsi  que  les  caracteres  de  base  de 
^H^  *H2)  .  .  .  ,  *##-!  sont  done  des  invariants  de  la  branche  Y^t),  Y2(t),  .  .  .  ,  Yn(t) 
pour  F^quivalence  canonique  et  par  consequent  pour  I'equivalence  regu- 
liere.  Remarquons  toutefois  que  les  caracteres  ainsi  que  les  caracteres  de 
base  de  */7,  *HV  *H2,  ...,  *HN^  ne  constituent  pas  un  systeme  complet 
d'invariants  ni  pour  I'equivalence  canonique  iii  pour  l'6quivalence  r6gu- 
lieres;  puisque  comme  nous  1'avons  vu  plus  haut  ces  caracteres  et  ces 
caracteres  de  base  ne  suffisent  pas  a  determiner  *H. 

Les  series  Y^t),  Y2(t),  ...,  Yn(t)  constituent  visiblement  un  systeme  de 
generateurs  de  *H  = 


A  la  fin  du  §  6  nous  avons  vu  comment  on  pouvait  construire  tous  les 
systemes  de  generateurs  de  *H  en  partant  de  ses  bases.  Nous  avons  vu  en 
particulier  que,  m  etant  le  nombre  de  dimension  de  *H,  c.a.d.  le  nombre 
de  ses  caracteres  de  base,  tout  systeme  de  generateurs  de  *H  contient  m 
elements  qui  constituent  a  eux  seuls  un  systeme  de  generateurs  de  *H.  Cela 
s'exprime  geom6triquement  en  disant  que  si  m  est  le  nombre  des  caracteres 
de  base  de  k[Yi(t),  Y2(t),  .  .  .,  Yn(t)]  1'une  des  projections  de  dimensions  m  de  la 
branche  F^),  Y2(t),  ...,  Yn(t)  est  son  equivalent  canonique  tandis  qu'  aucune 
projection  de  dimensions  inferieures  a  m  n'est  canoniquement  Equivalentes 
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NOTE  ON  CAHIT  ARF'S  "UNE  INTERPRETATION  ALGfiBRIQUE 

DE  LA  SUITE  DES  ORDRES  DE  MULTIPLICITY 

D'UNE  BRANCHE  ALG&BRIQUE"* 

By  PATRICK  Du  VAL 

{Received  30  September  1946.  —  Read  14  November  1946] 

Cahit  ArFs  results  being  severely  algebraic  in  form,  their  geometrical 
meaning  may  not,  at  first  sight,  be  evident  to  all  geometers;  it  is  felt  there- 
fore that  a  word  of  explanation  may  not  be  out  of  place. 

A  branch  being  parametrized  as  in  §  8  of  C.A.,  the  orders  of  the  elements 
of  the  ring 


are  the  possible  intersection  numbers  of  all  hypersurfaces  with  the  branch; 
in  fact  the  intersection  number  of  the  hypersurface 


is  clearly  the  order  of  the  element 


of  the  ring.  If  the  ring  H  is  canonical,  theorem  7  shows  that  these  inter- 
section numbers  are  all  the  multiplicity  sums  of  the  branch.  This  accordingly 
is  the  characteristic  property  of  a  canonical  branch.  It  does  not,  as  one 
might  suppose,  follow  from  this  that  there  exist  hypersurfaces  passing 
simply  through  every  number  of  consecutive  points  of  the  branch;  for  the 


corresponding  to  an  ordinary  cusp 

Y1  =  t*,     Y2  =  t* 

is  clearly  canonical,  whereas  there  are  no  curves  passing  simply  through 
more  than  the  first  two  points  of  the  branch. 


Supra,  256-287.    Referred  to  as  C.A. 
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Thus  the  uniqueness  of  the  canonical  closure  of  a  ring  means  that  every 
branch  is  a  projection  of  sequence,  unique  save  for  a  transformation  regular 
at  the  origin. 

The  number  of  base  characters  of  the  branch  clearly  gives  the  minimum 
space  in  which  there  can  be  a  branch  canonically  equivalent  to  the  given  one. 

To  fix  our  ideas,  let  us  consider  the  ring  generated  by 


considered  by  C.A.  The  classical  theory  of  Enriques  shows  that  the  points 
Px,  P2,  .  .  .  consecutive  along  this  plane  branch  are  of  multiplicities 

4,  4,  2,  2,  2,  2,  1,  1,  ..., 

P4  and  P8  being  satellite  and  the  rest  free.  Since  typical  elements  of  orders 
0,  4,  8,  10,  12,  14,  16,  18,  20,  22,  24,  25,  26,  28,  29,  30,  32,  33,  34,  36 


we  can  write 

H  =  k[XiX^ 

=  k  +  kt*>  +  kt*  +  ktw(  1 


>  *"•!>  -^l-^2>  ^l'  -^1-^2'  -^1>  -^2       -^1>  -"-l-**-2 


which  is  the  same  thing  as 
H  =  k  +  Art4  +  A;e8  +  A:^10(  1  +  *5)  +  ^12  +  kt™(  1  +  f  5) 

+  ^20  +  kt**(  i  +  t&)  +  to**  +  to™  +  to**(  i  +  t*)  +  kt™  +  to29  +  fo30  +  k[t]  t 

we  have  thus 
-^-  =  k  +  kt*  +  to*(  1  +  ^ 


and,  as  the  ring  generated  by  this  contains  the  elements 

("rp"    I  -A  1>          I  ~y^~  1-^2 

Ai/  \Ai/ 

of  order  17,  23  respectively,  we  have 


8  +  to17  +  ta18  +  &MJ20. 
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By  exactly  the  same  method  we  see  that 


and  since  this  is  clearly  canonical  we  find  that  the  canonical  closure  of  H^  is 

*Hl  =  k  +  t*Hz  =  k  +  kt*  +  kt«(  1  +  *5)  -f-  kt*  +  ktl«  +  k[t]  t™, 
and  that  of  H  is 

*#  =  k  +  «**fli  =  k  +  kt*>  +  kts  +  ktw(  I  +  £5)  +  kt™  +  to1*  +  fcp]  *16, 

of  which,  as  we  expect,  the  characters  are,  4,  10,  17,  the  first,  third,  and 
seventh  multiplicity  sums  of  the  branch.  Since 


where  X3  =  t17,  X±  —  t19,  we  see  that  the  canonical  branch  of  which  the  given 
branch  is  a  projection  is  in  four  dimensions.  Any  projection  of  this  into  two 
dimensions  is  represented  by  a  ring  of  the  form  k\Ylt  FJ,  where  Ylt  Y2  belong 
to  H,  and  can  clearly  be  chosen  to  be  of  orders  4,  10  respectively,  i.e.  we 
may  take 

Yl  -  **  +  a1*17  +  aa*1R+  ...,     F2  =  *w  +  *1*  +  61*17  +  6a*18+...; 

for  different  values  of  the  coefficients  a1,«2>  •••>  ^i>  ^a>  •••>  ^hese  branches  are 
not  regularly  but  only  canonically  equivalent. 

An  interesting  feature  is  the  apparent  unimportance  of  the  term  t15  in 
the  canonical  ring,  whereas  of  course  this  is  of  fundamental  significance  in 
determining  the  characters  of  the  plane  branch.  In  fact  the  canonical  ring 

*H  =  k  +  kt*  +  kt*  +  to19(  I  +  15)  +  to19  +  to14  +  k[f]  «16 
clearly  has  the  same  characters  as 

*H'  =  k  +  kt*  -f  kt*  +  ktw  +  kt™  +  tou  +  k[t]  t16, 

which  is  also  canonical.  This  latter,  however,  cannot  be  generated  by  two 
elements;  in  fact  a  base  for  it  must  be  in  some  such  form  as 

X[  =  t*,     X'2  =  t™,     X'3  =  tll') 
thus  of  the  two  canonical  branches 


and  X[  =  t*,     ^2  =  «10,     X'3  =  t17,     Xt  =  t19, 

both  of  which  have  two  fourfold  followed  by  four  twofold  and  a  succession 
of  simple  points,  the  former  can  and  the  latter  cannot  be  projected  into 
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a  plane  branch  with  the  same  multiplicity  sequence.  In  fact  a  general  plane 
projection  of  the  latter  is  of  the  form 


/10  _i_  A/17 


or  in  terms  of 


t  =  T 

and        Y{  =  T*,     Y'2  =  r10  +  j^r12  +  £2r14  4-  J53 

which  by  Enriques's  theory  clearly  represents  a  branch  with  two  fourfold 
followed  by  not  four  but  five  twofold  points;  the  canonical  closure  of 
k[  Y[,  Fg]  is  in  fact  not  *H  '  but  its  canonical  subring 

k  +  kt*  +  kt8  +  H10(  1  +  at7)  +  kt12  +  kt1*  +  JWW  +  k[t]  t18, 

where  a  is  a  fixed  constant  depending  on  those  in  the  expansions  of  Y[,  Y'z. 
*H'  is,  however,  the  canonical  closure  of 


or  of  the  ring  representing  a  general  projection  of  the  branch  corresponding 
to  *Hf  into  three  dimensions.  This  canonical  branch  can  accordingly  be 
projected  into  three  but  not  into  two  dimensions  without  changing  its 
characters;  in  short,  whereas  the  two  canonical  branches  considered  both 
have  the  characters  4,  10,  17,  the  base  characters  of  the  former  are  4,  10, 
and  those  of  the  latter  4,  10,  17. 

Projection  of  any  branch  from  a  general  point  clearly  gives  one  represented 
by  a  subring  of  the  ring  representing  the  given  branch;  if  the  projection 
alters  the  characters  (i.e.  the  multiplicity  sequence)  of  the  branch,  this  means 
that  the  canonical  closure  of  the  subring  is  not  that  of  the  given  ring,  but 
is  a  subring  of  the  latter;  i.e.  the  multiplicity  sums 


for  the  projected  branch  are  a  certain  selection,  not  the  whole,  of  the 
multiplicity  sums 


vlt     ^  +  v2, 
of  the  given  branch.  Thus  we  must  have  for  some  i^  iz,  ... 

*i=  S  *j,     "2=    S    "/,     "3=    S    fy     .. 
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It  seems  natural  to  regard  the  first  point  of  the  projected  branch  as  corre- 
sponding to  the  first  i±  points  of  the  given  branch,  the  next  point  of  the 
former  to  the  next  i2  points  of  the  latter,  and  so  on.  Thus  in  the  projection 
of  the  branch 

k  +  W  +  kt*  +  kt™  +  kt™  +  JB14  +  k[t]  *18 

into  a  plane  we  may  hold  that  the  seventh  and  eighth  points  of  the  canonical 
branch  (which  are  its  first  two  simple  points)  are  projected  into  the  same 
point  of  the  plane,  which  is  accordingly  a  fifth  double  point  of  the  plane 
branch.  This  is  seen  to  agree  with  the  genesis  of  a  cusp  by  projection  from 
a  point  on  the  tangent,  where  the  double  point  on  the  projected  branch 
certainly  arises  from  two  consecutive  simple  points  on  the  original,  the 
second  point  on  the  projected  branch  from  the  third  simple  point  on  the 
original,  and  so  on. 

Thus  where  a  branch,  such  as 


cannot  be  projected  into  a  lower  space  without  altering  its  characters,  this 
means  that  every  cone  (of  whatever  vertex)  which  passes  through  certain 
of  its  points  inevitably  passes  through  certain  others;  in  the  present  case 
every  cone  which  passes  through  the  seventh  point  passes  also  (and  with 
the  same  multiplicity)  through  the  eighth;  whereas  the  branch 


has  not  this  property. 
The  canonical  rings 

fli-KJ,     B*     H3,     ..., 

obtained  from  a  canonical  ring  H  represent  the  branches  obtained  by 
resolving  the  points  of  the  branch  in  succession,  as  is  seen  in  the  course  of 
the  proof  of  C.A.,  theorem  7.  Thus  the  base  characters  of  these  rings  show 
the  dimensions  of  the  least  spaces  into  which  these  resolved  branches  can 
be  projected  without  altering  their  characters.  For  instance,  of  the  branches 


*H  =  k  +  ^4"(1  +  1)  +  kt6v(l  +  t)  +  ktlv(l  +  t)  +  k[£]  tSv, 
*H'  = 


considered  in  C.A.  §7,  both  are  capable  of  being  projected  into  three 
dimensions  without  altering  their  characters,  and  each  has  a  4v-ple,  a  2v-ple, 
and  two  i/™ple  followed  by  simple  points.  The  branches 


tr(  1  +  1)  +  Jfcp]  tP, 
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obtained  by  resolving  the  first  points  of  each,  have  of  course  the  same 
multiplicity  sequence,  namely  a  2y-ple,  and  two  y-ple,  followed  by  simple 
points;  but  whereas  the  former  of  these  cannot  be  projected  into  a  plane 
without  altering  its  characters,  the  latter  can.  In  fact  the  general  plane 
projection  of  the  former  is  represented  by  a  ring  of  the  form  k\Ylt  F2]  where 


By  the  same  method  as  in  the  first  example  the  canonical  closure  of  this 
is  found  to  be 


the  characters  of  which  are  2v,  3*>,  5v+  1,  so  that  the  projected  branch  has  a 
2y-fold  followed  by  not  two  but  three  v-ple  and  a  succession  of  simple 
points,  i.e.  the  first  v  simple  points  of  the  branch  *H±  are  projected  into 
a  single  *>-ple  point;  the  same  result  is  obtained  by  expressing  the  ring  in 
terms  of  r  =  yj/2",  when  it  takes  the  form 


By  similar  methods  it  can  be  seen  that  the  projection  of  either  of  the 
original  branches  *H,  *H'  into  a  plane  has  one  4y-ple  and  two  2y-ple  followed 
by  simple  points,  i.e.  the  two  i>-ple  points  are  projected  into  a  single  2i>-ple 
point. 

In  the  same  way  it  can  be  seen  that  the  branches 

(ii)     Xi- 

(iii)     *i  = 

(iv)     X±  =  *702(1  -f  J7)13,     X2  =  *1620(1  -hi7)30,     X3  =  *2340(1  -H7)44, 

considered  in  C.A.  §  7,  all  of  which  have  the  characters 

702,      1620,     2340,     2383 
and  the  multiplicity  sequence 

702  (twice),     216  (three  times),     54  (five  times), 
18  (three  times),     7  (twice),     4,     3, 

followed  by  simple  points,  differ  in  the  minimum  space  into  which  their 
canonical  equivalents  can  be  projected  without  altering  their  characters, 
(ii),  (iii),  (iv),  indeed  all  exist  in  three  dimensions  and  cannot  be  projected 
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into  a  plane;  but  whereas  the  branch  obtained  by  resolving  the  first  five 
points  of  (iv)  can  be  projected  into  a  plane  without  altering  its  characters, 
those  obtained  from  (ii)  and  (iii)  in  the  same  way  cannot.  The  difference 
between  these  latter  is  not  quite  of  the  same  kind,  since  the  number  of  base 
characters  is  the  same  at  every  stage;  only  their  values  differ;  thus  whereas 
in  (iii)  the  ring  k[Xlt  X&  X%]  contains  an  element  of  order  2340,  in  (ii)  it 
does  not;  this  means  that  the  branch  (iii),  or  any  other  in  three  dimensions 
canonically  equivalent  to  it,  can  be  cut  by  a  surface  (in  the  given  form  the 
plane  X3  =  0)  so  that  the  intersection  number  is  2340,  i.e.  a  surface  can  be 
drawn  passing  simply  through  the  first  eleven  points  of  the  branch  (as  far 
as  the  first  18-ple  point),  while  in  the  case  of  (ii)  this  is  not  possible — every 
surface  passing  through  the  eleventh  point  either  passes  through  some 
further  points  after  it,  or  has  higher  multiplicity  at  some  of  the  earlier 
points. 

The  branch  (v)  is  a  plane  branch,  and  can  therefore  by  the  classical  theory 
be  expressed  in  the  form 

Xi  =  r7'2,     X2  -  r1620  +  ar1*74  +  fer1692  +  cr1699  +  . . .     (6  *  0,  c  4=  0). 

The  transformation  from  the  variable  t  to  r  =  Jf}/702  is  lengthy  but  straight- 
forward. 

It  is  hoped  that  enough  has  been  said  to  make  clear  the  geometrical 
significance  of  the  canonical  branch  of  which  a  given  branch  is  a  projection, 
and  of  the  number  and  values  of  its  base  characters. 


The  University 
Istanbul 
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By  L.  S.  BOSANQUET 

[Received  17  January  1946.  —  Read  17  January  1946.  — 
Received  in  revised  form  27  August  1946] 

1.  Necessary  and  sufficient  conditions  have  been  stated  by  Schur*  for 
a  series  2awenf  to  be  summable  (C,  p)  whenever  San  is  bounded  or  summable 
(C,  AT),  where  p  and  K  are  integers.  I  have  given  a  proof  of  Schur'  s  theorem  in 
a  recent  paper.  J  Here  I  extend  the  result  by  considering  also  series  which 
are  "more  than"  or  "less  than"  summable  (O,/c). 

We  suppose  throughout  that  p  and  K  are  integers.  We  write§ 


where  A  *  is  given  by  the  identity 

(i-*)--  I-HA;*"    (|*|<i)  (2) 

and                                       3n  =  a0+a1+...+an.  (3) 

Thus               ,S£  =  s0  +  s1+.  ..  +  «„,     «»=*„,     -S^a*.  (4) 

and                                                 ^  =  fl*(l).  (5) 
We  shall  use  the  identities 

(i)  ^t+i  =  SKflJ),     (ii)  Sr1  =  8?(8S),  (6) 

which  are  included  in               8%+*  =  ^(/Sf«).  (7) 
We  also  note  that 

«(«,)  -  a^«+1K)  =  8*(V8,),  (8) 


*  I.  Schur,  Journal  fiir  Math.  151  (1921),  79-111. 

00 

f  Here  and  elsewhere  S  denotes  2. 

o 

J  L.  S.  Bosanquet,  Journal  London  Math.  Soc.  20  (1945),  39-48.  This  will  be 
referred  to  as  I. 

§  We  shall  only  require  the  case  where  a  is  an  integer  (positive,  negative  or  zero). 
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and,  if  AC  is  a  positive  integer, 

<8£c(«,)  =     S    ^T^-St-l 

V=n-K  ^  =  0 

since  ^l^*"1  =  0  for  ju,  ^  ic+  1. 
Also,  if  /c  is  a  positive  integer, 


Finally,  we  write 

&,n  =  &Sn  =  Sn-sn+1,         A«  =  AA«-*     ((c>2),         AX  =  ««. 

so  that,  for  AC  =  0,  1,  ..., 

A**n  =  (-!)*£-£>„).  -(12) 

We  obtain  the  following  result,  where  the  case  p  =  0,  p  ^  0,  K  ^  0  is  Schur's 
theorem,  which  includes  the  Bohr-Hardy  theorem  (when  p  =  K)  and  a 
theorem  of  Bromwich  (when  p  =  0).* 

THEOREM  A.  //  —  1  </?  <  AC  (p,  K  integers)  and  p  is  real,  then  necessary  and 
sufficient  conditions  for  %anen  to  be  summable  (C\p)  whenever  S%  =  O(nK+p)  are 

(i)  en  =  o(n?~K-v),     (ii)  S(»  +  1)*+*  |  &*+1en  \  <  oo.  (13) 

//  p  >  K  ^  0,  the  conditions  are  the  same  as  when  p  —  K,  but,  if  p  >  K  =  —  1, 
the  first  condition  must  be  omitted. 

The  same  holds  with  O  and  o  interchanged. 

2.  We  require  a  number  of  lemmas.  In  all  proofs  we  consider  the  first 
of  the  alternative  statements.  Only  slight  modifications  are  required  in  order 
to  prove  the  companion  results. 

LEMMA  1.  //  K  ^  0  and  p  is  real,  and  if  £J  =  O(n*+*),  then  S%  -  O(nK+*) 
for  a  =  -1,0,  ...,ic-l.. 

The  same  holds  with  o  in  place  of  O. 

This  is  proved  by  repetition  of  (6)  ii,  since 

8-  ==  S-1^1)  =  «+l-  «±i     (n>  1). 


*  For  references  see  I. 

In  my  first  draft  of  theorem  A,  I  assumed  that  p  >  —  K—  1,  but  Prof.  Hardy  kindly 
pointed  out  that  this  condition  was  not  used  in  the  proof.  There  is,  however,  a  dis- 
tinction between  the  cases  p>  —  K—  1  and  p^—  K—  1.  For,  if  p>  —K—\,  it  is 
reasonable  to  write  sn  =  O(nv)  (C,  /c),  since  by  lemma  2  this  implies  8n  =  0(n*») 
((7,  /c-fl).  But,  if  p^  —  K—  1,  it  is  not  reasonable  to  do  so,  as  is  seen  by  taking 
S%  =  (n  +  1  )*+»,  when  ^+1  =  A  +  0(n*+1+*>)  if  p  <  -  AC  -  1  ,  and  #£"  ~  log  n  if  p  =  -  K  -  1  . 
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LEMMA  2.  Ifp  >  -*-  1*  and  S£  =  0(»*+*),  then  S£+1  =  0(nK+*+*>). 
The  same  holds  with  o  in  place  of  O. 
By  (6)  i  we  have 

8?*  -  8k(Sf)  =  S  0{(v+  1  )*+*»}  =  0(nK+*+»). 

i»-=0 

LEMMA  3*t  If  K  $*  2  and  #  is  reaJ,  am?  i/ 

(i)  ew  -  o(»-«),     (ii)  AX  =  o(w-*-«),  (14) 


o(7T-or-<0/0r  or  =  1,  2,  ...,  AT-  1. 

i£A  O  m  place  of  o. 

Suppose  that  g=t=0,  —  1,  ...,  —  (K—  1).  Then  since,  by  (14)ii  and  (12), 
S~K(ev)  =  ofa-"-*)  it  follows,  by  repeated  applications  of  (6)i  and  (5),  that 
there  are  constants  a^  (/*  =  1,  2,  .  .  .  ,  ic)  such  that 

S~-(ev)  =  SST'iS?*^)}  -X^AtT'-x  +  ofr-"-*)  (15) 

/«-i 

forcr  =  K-l,K-2,  ...,0.  On  taking  o*  =  Oin(15)  we  find  from  (14)  i,  by  (10), 
that  the  numbers  a^  are  either  zero,  if  ju,  ^  K  +  q,  or  arbitrary,  if  p  >  K  +  q  (and 
so  may  be  taken  as  zero),  and  hence  ^^fe^,)  ==  o(n~ff~q)  for  o"=  1,2,  ...,  AC—  1, 
i.e.  by  (12),  A^  =  o(»-^-«). 

Suppose  that  q  =  —  A  (A  =  0,  1,  ...,  AC—  1).  Then  it  follows  from  (14)  ii 
that  (15)  holds  for  <r  =  K-  1,  /c-2,  ...,  A+  1,  while  for  cr  =  A,  A-l,  ...,0, 

Snff(ev)  =  Ya^^r*"4  +  »(»-'-«  logn).  (16a) 

/*=i 

Taking  (7  =  Oin(16a)wefindfrom(14)ithata/4  =  Ofor/*=  1,  2,  ...,/c-A-  1, 
and  hence,  by  (15),  that  S»<r(el,)  =  o(n-*-«)  for  o*  =  AC-  1,  AC-  2,  ...,  A+  1, 
and  in  particular  that  w/S£A~1(ej;)  =  o(l). 
Now,J  by  (7),  (8)  and  lemma  2, 


(n  +  a)  Sf^M  ~  <*>%-  Afo)  =  ^{^r^1^)}  =  o(»«)  (16) 

for  a  =  0,1,...,  A,  and  it  follows  by  induction  from  (14)i  and  (16)  that 
Sn'fa)  =  o(n-*-«)  for  cr  =  1  ,  2,  .  .  .  ,  A. 


*  There  is  no  need  to  restrict  the  value  of  K . 

t  The  case  K  +  q  =  1  is  the  Cesaro-Tauber  theorem.  The  case  q  =  0  was  proved 
in  I  (lemma  8).  When  K  +  q<  1  condition  (i)  is  superfluous. 
J  The  final  step  is  simply  the  Cesaro-Tauber  theorem. 
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LEMMA  4.*  If  q  is  real  and  S(n+  1)«  |  Aew  |  <  oo,  then  there  is  a  number  s 
such  that  en  —  8  +  o(n~q). 

Suppose  that  #  >  0.  Then  S  |  Ae^  |  <  oo,  and  hence  there  is  a  number  s  such 
that  en->s,  and 


Suppose  that  q<0.  Then 

I  en~em  |  <  S  |  Ae,  |  <»-«  S  i*  |  Ae, 

v=m  v*=m 

for  some  m  and  all  n>m.  Hence  en  =  o(n~~q). 
LEMMA  5.f  //  /c  ^  1  ara£  g  is  reaZ,  and  if 

(i)  en  =  o(n-<*),     (ii)  S(»  +!)*+«  |  A*+%n  |  <oo,  (17) 

t^en  A^en  =  0(71-*-^). 

The  same  holds  with  0  in  place  of  o. 

It  follows  from  (17)ii,  by  (1  1),  (12)  and  lemma  4,  that  there  is  a  number  6' 
such  that  -" 


"-^.  (18) 

Hence,  if  #4=0,   —  1,  ...,   —  (K—  1),  by  (5),  (10)  and  repeated  applications 
of  <6)  {'  '-' 


and  it  follows  from  (17)i,  by  (10),  that  s  is  zero,  if  q^  —AC,  or  arbitrary,  if 
q  <  —  K  (and  so  may  be  taken  as  zero). 
Also,  ifq  =  0,  -1,  ...  or-(K-l), 


so  that,  by  (17)  i  and  (10),  s  is  zero. 

Thus  in  all  cases,  by  (18)  and  (12),  A*ew  =  o(n-K-<*). 

LEMMA  6.  IfQ^p^K(p,K  integers),  p  is  real,  q<p+l,  and  if  8%  =  O(nK+*>) 
and  (i)  en  =  o(nP-«-«),     (ii)  AX  =  o(n~*-«),  (19) 

then  Sfi(svev)  =  0(1*+*-*),  i.e.  snen  =  o(n?-«)  (C,p). 

The  conclusion  holds  if  O  and  o  are  interchanged  in  the  hypotheses. 
Suppose  that  p  =  0.  Then,  by  lemma  1  and  (19)i, 


*  The  case  g^O  was  proved  in  I  (lemma  6),  where  the  reference  to  Bromwich. 
applied  only  to  the  second  part. 

f  This  is  included  in  a  result  proved  for  q  =  0  by  A.  F.  Andersen,  Studier  over 
Ces&rcfs  summabilitets  metode  (Copenhagen,  1921),  and  for  q^O  by  L.  S.  Bosanquet, 
Journal  London  Math.  Soc.  17  (1942),  166-173.  If  q^  —K  it  is  sufficient  to  have  O  in 
(17)  i  to  obtain  o  in  the  conclusion. 
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Suppose  that  1  ^p  <  /c,  and  assume  the  result  with  K  replaced  by  K—  1  and 
c-1.  Write 

S^  =  #«iH.i+S#AV  (20) 

0  0 


Then          SfT^Si)  =  SKn  -  ,      n+1 

and,  by  lemma  3,  A*-^^  =  o{w"<*~1Hjf}.  Thus  /Si,  en+1  satisfy  the  hypotheses 
of  sn,  en  with  AC,  p,  p  replaced  by/c-l,p-l,p+l.  It  follows  from  the  case 
assumed  that 


(C>p_  1}. 

Again,  by  (1  1  )  and  lemma  3,  Aew  =  o[nr~<K-^-^+iy}t  where  r  =  min  (p,K-l  ), 
and  A^-1Aen  =  &Ken  =  O{TZ,-<'C-I>-<«+I>}.  Thus  ^i,  Aen  satisfy  the  hypotheses 
of  sn,  en  with  /c,  p,  p,  q  replaced  by  ic  —  1,  r,  p  +  1,  0+1. 

It  follows  from  the  case  assumed  that  S\  Aetl  =  o{n(p+1)-(Q+1)}  ((7,  r),  and  so 
=  o(np~Q)  (  (7,  p),  by  lemma  2,  since  ^>  —  ^>  —  T—  1.*  Hence,  by  (6)  i  and  (20), 


i.e.  oSfoe,,)  =  tf£ 

and  the  lemma  is  proved  by  induction. 

3.    Proof  of  theorem  A  (sufficiency).  Suppose  that  p  =  0,  K  =  0.  Write 

n  n 

o  l  o 

Then  by  (13)  i,  since  £°  =  O(nP), 


and,  by  (13)  ii,  S  |  SJAe,  |  <^S(v+  1)»  |  Ae,  |  <oo. 

Thus,  by  (21),  Sanen  converges. 

Suppose  that  p  =  0,  /c >  1  ara£  assume  the  result  with  K  replaced  by  K—l 
and  p  =  0.  Then,  by  lemma  1  and  (13)  i, 


*   Since  the  proof  is  by  induction  we  require  here  the  full  force  of  the  condition 
p  —  q>  —  l.  When  p  =  0  it  is  not  used. 

When  p  ^  1  the  result  is  false  with  q=p+l,  as  is  seen  by  taking 


(»+  I)}-1, 
so  that,  by  (9), 


and  «nen-2^(n+  I)'1  (log  (n+  I)}'1. 
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Also        S* 

by  (11)  and  (13)i,  and  S(»+  I^-J>HP+I)|  A*(Aen)  |  <oo,  by  (11)  and  (13)ii. 
Thus  $£,  Aeu  satisfy  the  hypotheses  of  an,  en  with  K,  p  replaced  by  K—  1, 
#4-1,  and  so  by  our  assumption  S$jAew  converges.  Thus,  by  (21),  %anen 
converges,  and  the  case  p  —  0  is  proved  by  induction. 

Suppose  that  l^p^K  and  assume  the  result  with  K  replaced  by  K  —  1  and 
O^p^K-  1.  Then,  by  (13)  and  lemma  5,  A*en+1  =  o(n~K-v).  Thus  #°,  en+l 
satisfy  the  hypotheses  of  sn,  en  in  lemma  6  with  q  =  p,  and  hence 


Also,  by  (11)  and  lemmas  5  and  3, 

Aeu  =  ofr- 
where  r  —  min(/o,/c—  1),  and  by  (6)i, 


Thus  $°  ,  Aew  satisfy  the  hypotheses  of  an,  en  in  the  present  theorem,  with 
K,  p,  p  replaced  by  K—  1,  p+1,  r.  It  follows  from  the  case  assumed  that 
S$°Aen  is  summable  (C,r),  and  hence  (C,p).   Thus,  by  (21),  "Zanen  is 
summable  (£7,/o),  and  the  case  1  </></c  is  proved  by  induction. 
Suppose  that  p  ~  —  1,  ic  >  —  1.  Then,  by  lemma  1  and  (13)  i, 


so  that  Ttanen  is  convergent,  and  if  &  ^  0,  />  =  —  1,  the  convergence  of  *£anen 
follows  from  the  case  p  —  0.  Thus  £anen  is  summable  (C,  —  1). 

Suppose  that  p>  K^  —  1.  Then  (13)i  and  (13)ii,  with  p  —  K,  ensure  that 
Sanew  is  summable  (C,  /c),  and  so  (C,p).  But  in  the  case  /c  =  —  1  the  second 
hypothesis  alone  is  sufficient  to  ensure  the  convergence  of  %anen. 

This  completes  the  proof  of  the  sufficiency  part  of  the  theorem. 

4.  In  order  to  prove  the  necessity  of  the  conditions  in  theorem  A  we 
require  some  more  lemmas. 

LEMMA  7*.  If  K^  —  I  and  pis  real,  and  if  anen  =  o(l)  whenever  8%  =  O(nK+p), 
then  en  =  0(71-*-*). 

The  same  holds  with  0  and  o  interchanged  throughout. 

Suppose  the  result  false,  so  that  lim  nK+p  \en\  >d>0,  for  some  d.  Then 
there  are  integers  nv  such  that  nQ  ^=0,  nv  —  nv^  ^  K  +  2  and  n$+p  \  env  \  >  d  for 
^=1,2,.... 

*  Proved  for  p  =  0  in  I  (lemmas  9  and  10). 
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Choose  an  so  that  8iv  =  U$+P(V>  1),  Si  =  0  (n^nj.  Then  Si 
and  by  (9),  since  Siv_^ 


Hence    |  anveny  \  =  nJ+»  |  en|F  |  >  £  for    v  =  1,  2,  ...,    which    contradicts   the 
hypothesis  that  anen  —  o(l). 

LEMMA  8.  //  2Zgn(x)  sn  converges  for  0  <  x  <  I  and  its  sum  tends  to  a  limit  as 
—  Q  whenever  {sn}  is  convergent,  then  there  are  numbers  M,  X  such  that 


This  is  due  to  Schur.* 

LEMMA  9.  //  K^  —  1,  p  is  real,  and  Sanen  is  summable  (A)  whenever 
S^  =  o(n*+*),  then  S(n+  1)*+*  |  A*+%w  \  <oo, 

The  series  Zanenxn  converges  for  0<x<  1  whenever  Si  =  o(nK+p),  and 
in  particular  if  Si  =  (  -  1  )n  (n  +  1  )«+P-I,  when  an  =  ^  '^(^J)  ~  2«+1n<c-^-1.t 
Since  the  same  is  true  with  x'  in  place  of  x,  where  x  <  x'  <  1  ,  it  follows 
immediately  that  nqenxn  =  0(1)  as  ^->oo,  for  0<x<l  and  every  real  q. 
Hence  if  Si=  o(nK+*)  we  have,  by  Ac-fl  partial  summations,  since 
Si  =  o(nK+v)  for  v  =  0,  1,  ...,/c,  by  lemma  1, 

SanettrB»  =  SS£A*+i(en*»)  (22) 

for  0<a?<  1.  Further,  since  S«nen  is  summable  (A),  the  right-hand  side  of 
(22)  tends  to  a  limit  whenever  Si  =  o(nK+p).  Hence,  by  lemma  8, 

S(w  +!)«+»  |  A*+1(ena;n)  |  ^  M 
for  X  <  x  <  1,  J  and  so,  on  letting  x  ->  1  —  0,  we  obtain 

S(w  +!)*+*  |  A^^  |<oo. 
LEMMA  10.  //  AC  ^  0  arw^  p  w  rea£,  ane?  */  Sawen  ts  summable  (A)  whenever 


TAe  .same  feoZrf^  w*<fe  O  and  o  interchanged. 

Suppose  ihatp^r  —  1,  —  2,  ...,  —  /c.  Then,  by  repeated  applications  of  (6)i 
and  (5),  whenever  sAi  =  O(nv)  there  are  constants  a0,  alt  ...,  a/f_1  such  that 


*  Schur,  loc.  cit.  f  Of.  footnote  to  lemma  6.  J  We  may  take  X  =  0. 
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i.e.  by  (5)  and  (7),         SKn  L-  '^^Af-*}  =  0(n*+»). 
I       //=o  ) 

Hence,  if  we  alter  sn  suitably  for  n  =  0,  1,  ...,/e  —  1,*  i.e.  alter  an  for 
n  —  0,  1,  ...,/c,  we  obtain  $£  =  O(nK+p).  It  follows  that  Sanen  is  summable 
(A)  whenever  sn  =  O(np),  and  so,  by  lemma  9,  with  K  =  0, 

£(ra+l)p|  AeJ<oo. 
It  follows  from  lemma  4  that  there  is  a  number  s  such  that 

*>),  (23) 


and,  if  p  <  0,  5  is  arbitrary  (and  so  may  be  taken  as  zero). 

If  p  ^  0  we  observe  that,  by  (23)  and  lemma  9,  en  —  s  satisfies  the  conditions 
of  en  in  theorem  A,  with  p  =  K.  Hence,  by  the  sufficiency  part  of  theorem  A, 
Zan(eu  —  s)  is  summable  (C,  AC),  and  so  (A),  whenever  $*  =  O(nK±p).  It  then 
follows  from  the  summability  (A)  of  *&ancn  that  ^san  is  summable  (A) 
whenever  $*  =  O(nK+f>). 

Now,  if  p  >  0,  the  example  an  =  (n  +  l)v~l  shows  that  s  must  be  zero,  and 
so,  by  (23),  en  -  o(n~P). 

If  p  =  0  we  may  take  an  =  (n+  I)-1  cos  {log  (n+  1)}  =  3t{(n+  l)1'-1}^ 
Then  Sari  is  bounded  (C,  0),  but  not  convergent.  Hence  $£  =  0(w*  ),  but 
SaM  is  not  summable  (A)  since  an  —  O(n~^).%  Hence  in  this  case  also  s  =  0, 
and  so,  by  (23),  en  =  o(n~v). 

Suppose  that  p  =  —&  (A  =  1,  2,  ...,  /c).  Then  Sanen  is  summable  (A) 
whenever  $£  =  O(7i'f-A),  and  so,  since  ^  =  /S^~A(/Sj),  whenever  8%  =  0(1). 
It  follows  from  lemma  9,  with/c  =  —  p  =  A,  that 


<oo.  (24) 

Hence,  by  (11),  (12)  and  lemma  4,  with  q  =  0,  there  is  a  number  s  such  that 


and  so,  by  repeated  applications  of  (6)  i  and  (5), 

en  =  Si{,Sf;r*  (e,)}  =  ^A  +o(n*).  (25) 


Since,  by  (12),  (5)  and  (7),  AA+M£  =  (  -  l)A+1^n+A+i  =  0  for  ^  ^  0,  it  follows 
from  (24)  and  (25)  that  en  —  sA*  satisfies  the  conditions  of  en  in  theorem  A, 


*  If  or  is  a  positive  integer,  A~a  ~  0  for  n^  or. 

t  See  G.  H.  Hardy,  Proc.  London  Math.  Soc.  (2),  8  (1910),  301-320. 
J  If  2an  is  summable  (A)  and  an  =  O(n~l),  then  San  is  convergent.   J.  E.  Little- 
wood,  Proc.  London  Math.  Soc.  (2),  9  (1910),  434-448. 
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with  p  =  K  =  -p  —  A,  and  hence,  by  the  sufficiency  part  of  theorem  A, 
^an(en-8^-n)  is  summable  (C,  A),  and  so  (A),  whenever  #£  =  0(1).  It  then 
foUows  from  the  summability  (A)  of  2anen  that  ^sA^an  is  summable  (A) 
whenever  £*  =  0(1). 
But  if 

a    = 


and  an  is  suitably  chosen  for  n  =  0,  1,  ...,  A-  1,  we  have  S*  =  0(1),  while 
2L4£an  is  not  summable  (A),  since,  by  (10), 


-1  cos  {log  (w+ 
Thus  s  =  0,  and  so,  by  (25),  en  =  o(n*)  =  o(n~»). 

5.  Proof  of  theorem  A  (necessity).  If  -1  ^p^K,  the  necessity  of  (13)i 
follows  from  lemma  7,  with  n~Pen  in  place  of  en,  since  awe,R  =  o(n?)  whenever 
S#nen  is  summable  (C,p),  by  lemma  1. 

If/c^  —  1,  /O  -1,  the  necessity  of  (13)ii  follows  from  lemma  9,  since 
Sareew  is  summable  (A)  whenever  it  is  summable  (C,p). 

IfQ^K<p,  the  necessity  of  (13)  i,  with  p  =  /c,  follows  from  lemma  10. 

To  show  that  the  condition  en  =  o(n~P)  need  not  hold  in  the  case  K  =  -  1, 
/)  ^  0,  we  observe  first  that  the  condition  £(?&+  I)*3"1  1  ew  |  <  oo  has  already 
been  proved  to  be  necessary  and  sufficient.  Now  let  €„» 
en  =  0  (n  +  v2).  Then 


i  i 

but  en4:o(7z--^). 

This  completes  the  proof  of  theorem  A. 

6.   We  end  by  giving  a  simple  application  of  theorem  A  (in  its  alternative 
form). 


THEOREM  B.*    If  —l^p<K  and  p  is  real,  and  if  Sn=*o(nK+p),  then 

£(r&+  l)f>-K-*>an  is  summable  (C,p). 

If  p  —  /c,  the  result  is  false  except  when  p  =  0,  —  1,  ...,  —  /c.f 

Proof.  We  have  en  =  (w  +  1  )*-*-i»  =  O^"*-*),.  which  is  the  alternative 

form  of  (13)  i. 


*  The  case  Q^p<K,  p  =  Q,  was  given  by  M.  Riesz,  Comptes  rendus,  148  (1909), 
1658-1660,  the  case  p  —  /c,  p  >  0,  by  G.  H.  Hardy  and  J.  E.  Littlewood,  Proc.  London 
Math.Soc.  (2),  11  (1913),  411-478,  and  the  cases  p  = /c,  -l<p<0andp  =  /c,p  =  -l, 
by  J.  M.  Hyslop,  Proc.  Edinburgh  Math.  Soc.  (2),  5  (1938),  182-201.  The  last  authors 
proved  further,  in  the  cases  p  =  /c,  p>0  and  p  =  /c,  —  l<p<0,  respectively,  that 
£(n-f  l)-p  an  is  either  summable  (C,  K)  or  not  summable  (A). 

t  p  may  be  replaced  by  p  +  iq,  where  g  is  real,  except  that  the  result  remains 
false  when  p  =  /c,  p  =  0,  —  1, . . .  —  /c,  g  56  0. 
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If  —  1  ^  p  <  K,  we  have 

S(%  +  !)«+*  |  b*+i{(n  +  iy-*-*}  |  <  4^(n  +  !)P-«-I  <  oo, 
while,  if  /o  =  ic,  _p  =  0,  —  1,  ...,  —  /c, 

)-*}  |  =  0, 


so  that  (13)ii  is  satisfied  in  both  cases.   But  if  p  =  K,  ^4=0,  —  1,  ...,  —  /c, 
there  is  a  number  A  >  0  such  that 


'1  =  oo, 
so  that  (13)ii  is  not  satisfied.  The  result  follows  from  theorem  A. 
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1.  In  1920,  H.  Steinhaus  (1)  proved  that  the  set  of  distances  between  the 
points  of  a  measurable  set  of  positive  measure  along  the  line  filled  up  an 
interval.  Moreover,  it  was  actually  shown  that  this  set  of  distances  included 
an  interval  with  the  origin  as  an  end-point.  An  application  of  Fubini's 
theorem  at  once  establishes  the  same  results  in  the  plane  when  ordinary 
Lebesgue  measure  is  used.  S.  Piccard  (2, 3)  has  also  shown  extreme  interest 
in  this  subject,  having  in  1939  published  her  book  Sur  les  ensembles  de 
distances  des  ensembles  de  points  d'un  espace  Euclidien,  and  in  1942  her  book 
Sur  les  ensembles  parfaits.  In  most  of  her  work  she  deals  with  sets  along  the 
line. 

In  this  paper  we  shall  turn  to  Euclidean  space  of  two  dimensions,  where 
the  measure  concerned  is  Caratheodory  linear  measure.  The  work  of 
Steinhaus  suggests  the  following  question:  Given  a  measurable  set  in  the 
plane  of  positive  measure,  is  it  necessarily  true  that  the  set  of  distances 
between  the  points  of  the  set  will  include  an  interval  about  the  origin? 
It  may  be  recalled  that  Besicovitch  in  1937  (see  (6),  324)  proved  that  any 
regular  set  consists  of  a  set  of  measure  zero  and  a  F-set  (a  F-set  is  a  set 
which  is  contained  in  a  denumerable  set  of  rectifiable  curves).  In  view  of 
the  somewhat  similar  behaviour  of  regular  sets  and  rectifiable  curves,  one 
might  expect  that  the  set  of  distances  between  the  points  of  a  regular  set 
would  fill  up  an  interval  about  the  origin.  We  have  been  able  to  prove  this 
to  be  the  case. 

In  general,  the  answer  to  our  question  turns  out  to  be  in  the  negative.  Two 
examples  of  irregular  sets  will  be  given,  such  that  the  set  of  distances  fills 
up  an  interval  about  the  origin  for  one  of  them,  and  does  not  for  the  other 
one.  However,  in  these  two  examples  both  sets  have  inferior  density  zero 
at  every  point.  Could  it  then  be  possible  that  the  added  restriction  of 
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positive  inferior  density  almost  everywhere  in  a  set  would  assure  that  the 
points  of  the  set  would  include  an  interval  about  the  origin?  An  example 
is  furnished  to  show  that  this  is  not  the  case.  Finally,  an  irregular  set  is 
introduced  to  show  that  the  set  of  distances  may  fill  up  an  interval  without 
including  any  interval  about  the  origin. 

2.   We  now  make  the  following  definitions.  Let  A  be  a  plane  point  set. 
For  each  sequence  of  bounded  open  sets  Ult  U2,  ...  lying  in  the  plane  and 


satisfying  the  conditions:  (a)  ^U^A  andf  (b)  d(U€)  ^p,  i  =  1,  2,  ...,  where 
p  is  a  pre-assigned  positive  number,  form  ^d(Ui).  Let  Lp[A]  denote  the 

oo  <«-! 

greatest  lower  bound  of  all  such  sums  2  d(U^).  Ifp  decreases,  LSA\  cannot 


decrease.  Define  lira  LJA]  =  L*\A\.  L*  [A]  is  called  the  outer  Carath^odory 

p->0 

linear  measure  of  A  .  Such  a  measure  function  will  satisfy  Carath6odory's 
six  fundamental  measure  postulates,  from  which  it  follows  that  closed  sets, 
open  sets,  etc.,  are  all  measurable.  J 

Define  c(p,  r)  to  be  the  set  of  points  lying  in  the  closed  circle  with  centre  p 
and  radius  r.  Let 


represent  the  superior  and  inferior  density  functions  respectively  of  the 
set  A.  If  D(A,p)  —  D(A,p),  then  the  density  D(A,p)  of  A  at  the  point  p 
is  said  to  exist  and  has  this  common  value.  If  the  density  at  almost  all 
points  of  A  exists  and  is  unity,  then  A  is  said  to  be  regular.  If  the  density 
at  almost  all  points  of  A  either  does  not  exist  or  exists  but  is  not  unity, 
then  A  is  defined  to  be  irregular. 

3.  In  this  section  we  shall  establish  that  the,  set  of  distances  between  the 
points  of  a  regular  set  contains  an  interval  about  the  origin.  It  has  been  men- 
tioned before  that  a  regular  set  consists  of  a  set  of  measure  zero  and  at  most 
a  denumerable  set  of  subsets  of  rectifiable  curves.  Thus,  we  need  only  prove 
our  result  for  the  case  of  a  linearly  measurable  set  E,  L[E]  >  0,  which  is  a 
subset  of  a  single  rectifiable  curve  C. 

f  The  symbol  d(  U)  will  be  used  to  denote  the  diameter  of  the  bounded  plane  set  U. 
Define  d(a,  6)  to  be  the  distance  between  the  points  a  and  b  in  the  plane.  For  two 
sets  A  and  B  in  the  plane  A  .  B,  A  +  B,  A  —  B  will  be  the  ordinary  point-set  inter- 
section, sum  and  difference. 

J  Only  Carathe"odory  linearly  measurable  plane  sets  will  be  used.  We  shall  adopt 
the  customary  notation  L[A]  to  represent  the  measure  of  a  measurable  set  A.  It 
may  be  shown  that  the  set  of  points  on  a  rectifiable  curve  is  measurable  and  has 
measure  equal  to  the  ordinary  length  of  the  curve. 
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Let  q  and  r  be  any  two  points  on  C.  Then  define  qr  to  be  the  set  of  points 
on  the  chord  joining  q  and  r  and  qr  to  be  the  set  of  points  on  the  arc  of  C 
joining  q  and  r. 

Let  a,  b  be  the  end  -points  of  C  and  choose  a  positive  number  a,  <  \n. 
Denote  by  /  the  set  of  all  points  of  the  subarcs  of  C  subtending  chords, 
whose  angles  with  ab  are  at  least  as  large  as  a.  (We  shall  refer  to  /  as  a  set 
of  arcs.)  Obviously  /  is  a  finite  or  denumerable  set  of  non-overlapping 
subarcs  of  C.  Furthermore,  from  the  very  definition  of  /,  the  chord  joining 
any  point  of  C  —  /  to  any  other  point  of  C  forms  an  angle  less  than  a  with  ab. 

Fix  7}>Q.  By  the  definition  of  /  we  can  choose  a  finite  set  S^S^,  ..  .,  Sk 
of  non  -overlapping  arcs  of  C  subtending  chords  whose  angles  with  ab  are 
as  large  as  a,  and  satisfying  the  inequality 

S£tSJ  >*£[/]-*  (l) 

i=l 

Let  $(,  $£,  ...,  S'k'  denote  the  complementary  arcs  of  the  set  819  S2,  .  .  -  ,  Sk 
with  respect  to  C.  The  sum  of  the  absolute  values  of  the  orthogonal  pro- 
jections of  S-L,  /S2,  .  .  .  ,  Sk,  S[,  $a»  •  •  •  »  &kr  on  ab  ig  a^  least  equal  to  L[ab],  and  so 

S  L[Si]  cos  a  +  S  L[Si]  ^  L\ab]. 

i=l  i=l 

From  this  it  is  easy  to  see  that 

I 

i«i 
Consequently,  using  (1),  we  have 


This  being  true  for  any  ij  >  0,  we  have 


- 
1  —  cos  a 

Inscribe  a  polygonal  line  p0pt  .-pn  into  the  arc  ab  so  that  p0  and  pn 
coincide  respectively  with  a  and  b.  On  any  arc  p^iPi  consider  all  the  arcs 
subtending  chords  whose  angles  with  p^^Pi  are  as  large  as  a.  Denote  by 
/^  the  set  of  points  of  such  arcs  corresponding  to  all  pi-iPi-  By  (2) 


(3) 
for  »=  1,2,  ...,n,  and  thus  ^      w_i      

:2 — - 


1  —  cos  a 

U2 
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I&)  can  be  considered  as  a  finite  or  denumerable  set  of  non-overlapping 
subarcs  ofp^pt,  i  =  l,i,  ...,n. 

We  now  choose  <%  so  that         cos  2a  >  ff  ,  (±) 

and  a  polygonal  line  pQp±  ...pnso  that 

(5) 


Denote  by  C±  the  curve  obtained  from  C  by  replacing  the  arcs  of  I(p)  by  their 
chords  (note  that  the  points  pi  still  remain  on  C^).  From  now  on  qr  will 
denote  an  arc  of  C±  and  not  of  G.  Let  E1  be  the  set  EC±.  Then  by  (5) 


A  =  L[E]  -  L[IW  E]  ^  L[E]  -  £[/<*»]  >  0.  (6) 

Let  q  be  an  interior  point  of  the  arc  p^^pi  and  0  <  t  ^  L[qp^\.  Denote  by 
q'  the  point  of  qpt  such  that  L[qq']  =  t.  We  shall  call  qr  the  ^-transform  of  q. 
From  the  fact  that  no  chord  of  p^Pi  forms  an  angle  >  a  with  p^pi,  it 
may  be  shown  that  to  every  point  q  of  Pi-^pi  such  that  L[qpt]  ^  t,  there 
corresponds  one  and  only  one  ^-transform  q'  '. 

Denote  by  qi  the  point  of  Pi-iPt  such  that  L^p^  =  t.  If  G  is  any  set  of 

points  of  ^_!^,  there  exists  one  and  only  one  £-  transform  of  every  point 
of  G.  The  set  of  ^-transforms  of  all  points  of  G  is  called  the  ^-transform  of  G 
and  is  denoted  by  G'.  If  G  is  an  arc  cd,  then  G'  is  the  arc  c'd',  c'  and  df  being 
the  Z-transforms  of  c  and  d  respectively.  As  the  angle  between  any  chord 
i  and  pi-iPi  is  ^  a,  the  angle  between  any  pair  of  chords  c^,  c2d2 
=  2a-  From  this  it  follows  that,  if  c  is  an  interior  point  of  p^qi 

and  d  a  variable  point  of  2^_i<fo,  and  if  c',  d'  are  the  corresponding  ^-trans- 
forms, then 

..     L[c*df]     .—  L[cW]         1 
cos2a<lim~L^=r-J  <  hm  ~-^rr  J  ^  —  —  —  . 

~d=Tc-  £[«*]      *-*«  -^M]      C0s2a 

Hence,  if  cd  is  any  arc  on  p^-^qt  and  if  c',  d'  are  the  corresponding  ^-trans- 
forms of  c  and  c?,  then 

(7) 


3i    cos  ^a 

If  c  €  E±  and  k  e  C^  then,  for  almost  all  c, 

-  =  I-  (8) 
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Let  C€.E±  be  an  interior  point  of  some  Pi-rfi  satisfying  (8).  Then  there 
exists  a  point  &0  on  cpi  such  that  for  any  point  k  €  ckQt 

^B^]>?  (9) 

L[ck]     4' 

We  shall  show  that  any  t  <  %L[ckG]  —  A,  and  hence  the  open  interval 
(0,  A)  belongs  to  the  set  of  distances  between  the  points  of  E. 

Take  the  point  d  on  cpi  such  that  L[cd]  =  t.  Let  d'  be  the  ^-transform  of  d. 
As  any  chord  of  cd(ddf)  forms  an  angle  <  2a  with  the  chord  cd(ddr),  we  have 


%t.  (11) 

Let  J,  J±  C  J  be  two  sets  of  non-overlapping  arcs  of  cd,  such  that 

E^dCJ,     J-J^CE^cd,     L[J^<^t.  (12) 

Also  let  J',  J(  be  the  ^-transforms  of  J,  J±  so  that  J'  —  J{  is  the  ^-transform 
ofJ-Ji.  By  (7) 

L\J'  -  Ji]  =  L[J']  -  L[J[]  >  cos  2*  L[J]  -  ^-2-  L[J,].  (13) 

By  (9)  and  (12)                  L^cd]  >  f^,     L[J]  >  %t.  (14) 

Hence  L[J'  -  J{]  >  ^t-  ^t.  (15) 
By  (11)  and  (15) 

L[Elri']  +  L[J'-J'1]>(%  +  £:-1&)t>¥-t.  (16) 

Each  of  the  two  sets  E^cd'  and  J'  —  J[  are  on  cd',  and  by  (10)  and  (16)  the 
sum  of  their  measures  is  greater  than  the  measure  of  cd'  .  Consequently 
some  of  the  points  J'  —  J{  belong  to  E^cd'  .  Let/'  be  one  of  such  points.  It  is 
the  ^-transform  of  a  certain  point  /  of  J  —  J±.  As  the  whole  of  J  —  J±  belongs 
to  E^cd,  /  and/'  belong  to  E±cd'  and  they  are  a  pair  of  points  at  a  distance 
t  apart.  This  completes  the  proof. 

4.  We  now  consider  the  set  of  points  lying  interior  to  and  on  the  boundary 
of  the  square  p  =  ^  (0£X£l,  Ogy£l). 

(x,V) 

An  operation  of  order  n  on  the  set  F  is  defined  as  follows:  Divide  the  square 
F  into  n*  equal  closed  squares.  Of  these  the  n2  mutually  exclusive  closed 
squares  having  lower  left-hand  vertices 


will  be  the  result  of  the  operation  of  order  n  on  the  set  F. 
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On  the  set  F  perform  the  operation  of  order  3  obtaining  32  mutually 
exclusive,  closed  squares,  each  a  subset  of  the  set  F  and  having  side-length 
1/32.  It  will  be  said  of  the  32  closed  squares  that  they  belong  to  class  2.  Each 
of  the  closed  squares  of  class  2  will  be  denoted  by  /?2  and  the  union  of  the  32 
squares  /?2  by  B2.  On  each  of  the  squares  /?2  perform  the  operation  of  order 
4  obtaining  the  32.42  mutually  exclusive,  closed  squares  of  class  3  each 
having  side-length  l/(32.42).  Each  of  the  closed  squares  of  class  3  will  be 
denoted  by  /#8  and  the  union  of  the  32  .  42  squares  /?3  by  B3.  In  general,  on 
each  of  the  32  .  42  .....  (n  -f  1  )2  squares  ftn  of  class  n  perform  the  operation  of 
order  n+l  obtaining  the  32.42  .....  (raH-2)2  mutually  exclusive  closed 
squares  of  class  n+l  each  having  side-length  l/[32.42  .....  (n  +  2)2].  Each 
of  the  closed  squares  of  class  n  +  1  will  be  denoted  by  /?n+1  and  the  union  of 
the  32  .  42  .....  (n  +  2)2  squares  by  JBW+1. 

Consider  the  setf  B  =  B1  .  B%  .....  The  setf  B  is  perfect,  and  since 
B19  B2,  ...  form  a  decreasing  sequence  of  closed  sets,  B  is  also  non-empty. 
Define  the  set  Fcn  =  g  [x  =  c;  (x,  y)  e  Bn],  where  0  ^  c  ^  1  (n  =  1,  2,  .  .  .).  It 

(x.ti) 

may  readily  be  seen  that  the  set  Fc  —  Fcl.Fc2.  ...  is  non-empty.  However, 
the  set  Fc  is  just  the  intersection  of  the  set  B  with  the  line  $  [x  =  c],  and 

(x,y) 

so  the  projection  Bx  of  the  set  B  on  the  #-axis  completely  fills  the  unit 
interval.  From  the§  inequality  L[B]  ^  L[BX]  —  1,  it  follows  that  the  set  B 
is  not  of  measure  zero.  Let  sn  be  the  side-length  of  any  square  fin  of  class  n. 
Then  the  sum  of  the  diameters  of  the  32  .  42  .....  (n  +  1  )2  squares  of  class  n  is 
equal  to  2*.  Furthermore,  lim  2*sn  =  0.  From  these  remarks  we  see  that  a 

n->ao 

sequence  of  convex  sets  Ult  Uz,  .  .  .  may  be  chosen  which  satisfies  the  following 
conditions  : 

(a)  ZU^B, 
i~i 

(b)  cZ(L^)^/)  for  *=  1,2,...,  where  p  is  some   preassigned   positive 
number, 

(c)  S«*(ff«)  -  2*. 


Such  a  sequence  is  formed,  for  instance,  by  the  squares  of  Bn  for  n  satis- 
fying the  inequality  2*sn^p.  Thus  Lp[B]^2*  for  each  positive  p  and  so 


f  Note  that  J5X  =  F. 

J  This  set  was  invented  by  Gross  (see  (5),  434). 

§  For  a  plane  set  A,  L*[A\^L*{A^\t  where  the  set  A1  is  the  projection  of  A  on 
a  line  I  in  the  plane. 
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We  are  now  in  a  position  to  show  that  D(  It,  p)  =  0  for  all  points  ^>  belonging 
to  the  set  B  and  so  consequently  B  will  be  an  irregular  set.  Let  fin  and  /?„  be 
two  squares  of  class  n  such  that  d(/3nifi'n)  does  not  exceed  the  distance 
between  any  two  other  squares  of  class  n.  A  simple  geometric  investigation 
will  show  that  d(/?w,/%)  =  nsn  for  n  ^  4.  A  point  pe  B  will  belong  to  some 
square  fin  of  class  n.  It  is  now  easy  to  see  that  the  circle  c(p,  nsn)  includes  only 
the  part  of  B  in  the  square  /?n,  and  so 

L[B.c(p,mJ]      L[B.pn}      2* 


Since  lim  nsn  =  0,  it  follows  at  once  that  D(B,p)  =  0. 

n-x» 

We  now  show  that  the  set  of  distances  between  the  points  of  the  irregular 
set  B  does  not  include  an  interval  about  the  origin.  For  a,  an  arbitrary 
positive  number,  define  the  interval  Ia  —  <£  [0<a;<a].  Choose  n^4  such 

that  nsn<a.  Then  clearly  the  interval  /  =  $  [2*  sn  <  x  <  ns^  is  contained 
in  /a.  Let  p,qeB.  Ifp  and  q  belong  to  a  single  square  fin  of  class  n,  clearly 
d(p,  q)^2*sn.  If  pzfin  and  q  belongs  to  any  other  square  of  class  n,  the 
above  remarks  show  that  d(p,  q)  ^  nsn.  This  shows  that  the  set  of  distances 
between  the  points  of  B  will  not  fill  up  the  interval  /  and  so  certainly  not 
the  interval  Ia.  This  establishes  the  required  result. 

5.  We  now  introduce  an  irregular  set  possessing  the  property  that  the  set 
of  distances  between  the  points  of  the  set  actually  includes  an  interval  with 
the  origin  as  an  end-point.  Let  F  be  the  set  of  points  lying  inside  and  on  the 
boundary  of  an  equilateral  triangle.  For  simplicity  let  one  side  of  the  tri- 
angle coincide  with  the  unit  interval  along  the  #-axis.  Construct  on  F  three 
closed  triangles  of  side-length  1/3,  each  one  having  two  sides  and  a  vertex  on 
the  boundary  of  the  original  triangle  F.  It  will  be  said  of  the  three  mutually 
exclusive,  closed  triangles  that  they  belong  to  class  1  .  Each  closed  triangle 
of  class  1  will  be  denoted  by  /?x  and  the  union  of  the  three  triangles  ft^  by  B^. 
On  each  triangle  /?j  of  class  1  construct  three  closed  equilateral  triangles  of 
side-length  1/32,  each  one  having  two  sides  and  a  vertex  on  the  boundary  of 
the  original  triangle  /?t.  It  will  be  said  of  the  resulting  32  mutually  exclusive 
triangles  that  they  belong  to  class  2.  Each  triangle  of  class  2  will  be  denoted 
by  /?2  and  the  union  of  the  32  sets  y#2  by  -^2-  Continuing  in  this  manner,  we 
arrive  at  the  sequence  of  setsf  {Bn}.  Writing  B  =  Bl  .  B2  .  .  .  .  ,  we  see  that  B 
is  closed  and  non-empty. 

t  The  set  B  was  first  invented  by  Sierpinski  (see  (7),  184). 
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The  orthogonal  projection  Bx  of  B  on  the  #-axis  is  the  unit  interval.  It 
follows  at  once  that  L[B]  ;>  1.  It  may  be  noted  that   2   d(fin)  =  1,  and  so 

J?»el?n 

Bn,  for  sufficiently  large  n,  represents  a  sequence  of  convex  sets  Ult  U2,  ... 
such  that 


(6)  d(Ui)<p  (i  —  1,2,  ...),  where  p  is  a  preassigned  positive  number, 


Thus  £,[£]  ^  1  and  so  L[E\  =  1. 

We  now  show  that  the  set  B  is  irregular.  An  arbitrary  point  p  €  B  will 
belong  to  some  triangle  fin  of  class  ?&.  It  is  easily  verified  that 


(l/3n  is  the  side-length  of  the  triangle  /?„).  From  this  it  follows  that 


Consequently  D( 

To  prove  that  the  set  of  distances  between  the  points  of  B  includes  an 
interval  about  the  origin,  merely  observe  that  the  Cantor  perfect  set  in  the 
unit  interval  about  the  ic-axis  is  a  subset  of  B.  It  has  already  been  established 
(see  (4))  that  the  set  of  distances  between  the  points  of  the  Cantor  set  along 
a  line  fills  up  an  interval  including  the  origin  as  an  end-point. 

6.  Referring  back  to  §  3,  it  may  be  noted  that  the  Gross  set  is  irregular, 
possessing  inferior  density  zero  at  each  one  of  its  points.  Moreover,  the  set 
of  distances  between  the  points  of  the  set  does  not  fill  up  an  interval  about 
the  origin.  The  question  now  arises  as  to  whether  positive  inferior  density 
at  almost  all  points  of  a  set  assures  that  the  set  of  distances  between  the 
points  of  the  set  actually  fills  up  an  interval  including  the  origin  as  an  end- 
point.  The  following  example  due  to  Besicovitch  (see  (5),  431)  is  an  answer 
to  this  question. 

Let  the  set  S  be  the  interior  of  a  circle  of  radius  r.  Define  an  operation  of 
order  n  with  respect  to  the  circle  S  as  follows.  Construct  the  concentric 
circle  of  radius  (1  —  l/n)r,  divide  its  circumference  into  n  equal  parts  and 
draw  circles  of  radius  r/n  about  the  points  of  subdivision.  The  results  of  the 
operation  of  order  n  is  n  mutually  exclusive,  open  circles,  each  of  radius 
r/n  and  each  lying  interior  to  the  open  circle  S. 

We  now  introduce  the  following  set  F.  Let  B±  be  an  open  circle  of  radius  1  . 
It  will  be  said  of  this  circle  that  it  belongs  to  class  1  .  Perform  the  operation 
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of  order  4  on  the  set  Bt  obtaining  4  mutually  exclusive,  open  circles,  each 
a  subset  of  set  B±  and  having  radius  J.  It  will  be  said  of  the  four  open  circles 
that  they  belong  to  class  4.  Each  of  the  open  circles  of  class  4  will  be  denoted 
by  /?4  and  the  union  of  the  four  sets  /?4  by  B±.  On  each  circle  of  class  4  per- 
form the  operation  of  order  5  obtaining  4  .  5  mutually  exclusive  open  circles 
of  class  5,  each  of  which  is  a  subset  of  some  circle  of  class  4  and  has  radius 
r&  =  l/(4.5).  Furthermore,  perform  this  operation  so  that  none  of  the 
circles  of  class  5  are  tangent  to  the  original  circle  B±.  In  general,  on  each 
circle  of  class  n—  1  perform  the  operation  of  order  n,  obtaining  4.6  .....  n 
mutually  exclusive,  open  circles  of  class  n,  each  contained  in  a  circle  of 
class  n—l  and  having  as  its  radius  rn  =  l/(4.5.  ...  .n).  Each  of  the  circles 
of  class  n  will  be  denoted  by  ftn  and  the  union  of  the  4.5  .....  n  sets  /?n 
by  Bn.  Perform  this  operation  so  that  none  of  the  circles  of  class  n  are 
tangent  to  a  circle  of  class  n  —  2. 

Consider  the  intersection  F  —  Bl.Bi.B5  .....  Let  /a  again  re/present  an 
interval  about  the  origin  with  right-hand  end-point  a.  Choose  an  integer 
k  ^  5  such  that 

(a)  rk<%a, 

(b)  (fc-l)sm(7r/&)-l>2. 

This  is  possible  since  limrw  —  0,  and  lim{(^—  1  )  sin  (TT/TZ,)  —  1}  =  TT—  1. 

n-*-oo  n-><x> 

Suppose  p  and  q  to  be  any  two  points  belonging  to  the  set  F.  If  p  and  q 
both  belong  to  a  circle  /?fc  of  class  k,  clearly  d(p,q)^  2rk.  If  p  and  q  belong 
respectively  to  different  circles  j$k  and  ft'k  of  class  k,  both  of  which  are  con- 
tained in  a  single  circle  flk_lt  geometric  considerations  show  that 

2(rfc-i  -  rk)  sin  (n/k)  ~  2rk  ^  d(P>  £)• 
However,  using  the  definition  of  rk  and  (6),  it  follows  that 

2(rfc_i  -  rk)  sin  (TT/&)  -  2rk  =  2rk{(k  -  1  )  sin  (n/k)  -l}>  4rk. 

Finally,  if  p  belongs  to  a  circle  pk  of  class  k  contained  in  a  circle  /^fc_L  of  class 
k—I,  and  if  q  is  outside  /?#_!,  it  may  be  verified,  since  k  ^  5,  that 


From  (a)  it  is  readily  seen  that  the  interval  /  =  $  [2rk  <x<  4rJ  is  within 

the  interval  Ia.  The  above  considerations  then  show  that  the  set  of  distances 
between  the  points  of  F  will  not  include  the  interval  /.  Since  7a  is  an 
arbitrary  interval  about  the  origin,  we  conclude  that  the  set  of  distances 
between  the  points  of  the  set  F  does  not  include  an  interval  about  the 
origin. 
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Besicovitch  (see  (5),  434)  proved  that  the  upper  density  of  F  is  equal  to 
J  at  every  point  of  .F.  Furthermore,  he  proved  that  the  lower  density  of  F 

is  equal  to    .      2        —  at  almost  all  points  of  F.  Thus  we  have  an  example 

of  an  irregular  set  where  the  lower  density  is  positive  almost  everywhere 
and  where  the  set  of  distances  between  the  points  of  the  set  does  not  fill  up 
an  interval  about  the  origin. 

7.  Finally,  we  shall  give  an  example  of  an  irregular  set  such  that  the  set 
of  distances  between  the  points  will  fill  up  an  interval,  and  yet  will  not 
include  an  interval  about  the  origin. 

Consider  a  Cartesian  coordinate  system  C  and  a  polar  coordinate  system 
P  set  up  in  the  plane.  Let  T  be  a  transformation  which  takes  a  point  (x,  y) 
referred  to  C  into  a  point  [1  +  a?,  y]f  referred  to  P.  We  shall  now  use  the  Gross 
set  B  (as  defined  in  §  3)  in  the  system  C.  Take  the  T-transform  of  B  together 
with  the  origin  in  P  and  denote  this  set  by  TB.  It  will  be  shown  that  the  set 
TB  possesses  the  desired  properties. 

First  of  all,  the  set  TB  is  closed.  For,  let 

(xn,yn)<=Bn     (n  =  1,2,  ...). 


It  is  then  easily  verified  that 

*n)Bin»i(y1.-tf)}  =  0, 


from  which  it  follows  immediately  that 

lim  (1+  xn  -  r)  ==  0     and     lim  (yn  -  6)  =  0. 

n—  >«j  n->oo 

Since  B  is  closed,  a  point  (x,  y)zB  exists  such  that  lim  xn  —  x  and  lim  yn  =  y. 

7l->00  n->00 

Clearly  r  —  1  +  x  and  0  —  y,  and  so  [r,  6]  =  e  TB. 

Next,  we  shall  make  an  investigation  of  the  lower  density  at  each  point 
of  TB.  Let  8n  be  the  T-transform  of  the  square  J3n  used  in  the  definition  of 
the  set  B.  Following  the  procedure  in  §  3,  we  shall  say  that  the  sets  dn  belong 
to  class  n,  and  their  union  will  be  denoted  by  the  symbol  Aw.  A  simple 
verification  shows  that  TB  —  Ax  .  A2  —  ,  and  so  TB  is  a  closed  linearly  measur- 
able set.  Furthermore,  it  follows  at  once  that  L[TB]  ^  1,  and  so  TB  is  non- 
empty. 

For  n  =  3,  4,  ...,  define 


Pn 


f  The  symbol  (#,  y)  will  represent  Cartesian  coordinates.,  while  [#,  y]  will  denote 
the  polar  coordinates  of  a  point. 
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The  lowest  vertex  nearest  the  origin  and  the  highest  vertex,  both  of  the 
set  8n,  will  have  respectively  polar  coordinates 

t1  +Pn+l>  SWJ      and      C1  +Pn+l  +  *n+l>  0n+l  +  <»+J- 

A  simple  computation  then  shows  that 

d(Sn)  =  {4+1  +  4(l+j>n+1)(l+pn+1  +  <re+1)sinH«m+1}*. 

From  this  equation  it  follows  that  for  every  #n€An  the  inequality 
d(dn)  ^  5*  tn+l  holds.  However,  since  there  are  32.42  .....  (n+  I)2  sets  dn 
belonging  to  An,  and  since  Km  tn  =  0,  for  an  arbitrary  p>  0  there  exists 

n->oo 

a  sequence  of  open  sets  Ult  U2,  .  .  .  such  that 

(a)  ^U^Ts,         (b)  d(Ui)<p     (i  =  1,2,...),         (c)  2^(^)^5*. 
t-i  i-i 

This  shows  at  once  that  Lp[TD}  ^  5*,  and  so  L\TB~\  ^  5*. 

It  is  now  observed  that  an  N  >  0  exists  such  that,  for  n^N,  the  relation 

(n-l)mmd(Sn)  =  (n-l){4- 


holds.  For  n^N,  consider  two  sets  3n  and  <%,  of  class  n,  such  that  d(<5n,  <5^) 
does  not  exceed  the  distance  between  any  two  other  sets  of  class  n.  Geo- 
metric considerations  show  that,  for  every  #neAn,  there  exists  a  #*eAn 
such  that  d($n,  §„)  —  ntn+1,  and,  furthermore,  that  either  d($n,  8'n)  equals 
ntn+1  or  is  as  great  as  (n  —  1)  min  d(Sn).  The  previous  inequality  rules  out  the 

*n^n 

second  case  and  so  d(Sn,  8'n)  =  ntn+1.  Let  p  €  TB)  in  which  case  p  will  belong 
to  some  8n  of  class  n.  If  n  ^  max  (3,  N),  certainly  ntn+1  ^  &tn+l  ^  maxd(#n). 
Thus  *«€J» 

L[rB.c(3>>ne>+1)]==L[yB.an]    g* 

2n«n+1  2ntn+l    ~  2n> 

and  lini  [5*/2n]  =  0.  This  proves  that  D(TB,p)  —  0  everywhere,  and  so  the 

n—  >-oo 

set  TB  is  irregular. 

To  show  that  the  set  of  distances  between  the  points  of  TB  does  not  include 
an  interval  about  the  origin,  for  an  arbitrary  a  >  0  consider  the  interval  Ia 
as  defined  in  §  3.  Choose  n^N  such  that  max  d(dn)  <  ntn+l  <  a.  Let  p,  q  €  TB. 

&n*An 

If  p,  q  €  dn  of  class  n,  then  clearly  rf(  £>,  q)  ^  max  <^(#n).  If  p  €  $n  and  <£  belongs 

*»e<d» 

to  any  other  set  of  class  n,  then  d(p,  q)  ^  w£n+i.  The  interval 
I  =  &  [max  d(#n)  <  x  <  ntn+l] 

X     *»€/!„ 

is  clearly  contained  in  Ia.  However,  from  the  above  consideration,  the  set 
of  distances  between  the  points  of  TB  will  not  include  the  interval  /.  This 
establishes  the  desired  result. 
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We  finally  show  that  the  set  of  distances  between  the  points  of  TB  will 
fill  up  the  interval  J  =  <sf  [1  <  a;  <  2].  Consider  an  arbitrary  aeJ.  Obviously 

d(a,  0)  =  a.  Since  the  projection  of  the  set  B  on  the  #-axis  fills  up  the  unit 
interval,  there  exists  a  point  (x,y) e B  such  that  x  =  a—I.  Thus  a  =  x+  1 
and  so  the  point  (a,  y)  actually  belongs  to  the  set  TB.  Therefore,  the  distance 
a  is  actually  attained  by  two  points  of  TB  which  proves  the  desired  result. 
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1  .    Introduction 

A  Riemannian  Vn  (n>2)  is  called  simply  harmonic  (Walker,  3,  25)*  if, 
for  every  pair  of  points  (x1),  (XQ),  Hadamard's  "elementary"  solution 
(Hadamard,  11)  of  the  generalized  Laplace  equation  A2F  =  0  reduces  to 
l/sn~2,  where  5  is  the  geodesic  distance  between  the  two  points.  For  n  —  2 
or  3  a  simply  harmonic  space  is  flat,  and  the  elementary  solution  reduces 
to  logs  or  1/5  respectively,  or,  in  a  more  familiar  notation,  to  logr  or  1/r. 
A  simply  harmonic  Vn9  for  any  n,  is,  incidentally,  an  Einstein  space  of  zero 
scalar  curvature,  the  Ricci  tensor  being  everywhere  zero. 

In  a  recent  paper  (Ruse,  10)  I  demonstrated  the  existence  of  simply 
harmonic  spaces  for  which  the  curvature  tensor  satisfies  the  relation 


for  some  vector-field  tcp,  the  left-hand  side  being  the  covariant  derivative 
of  jRw.  It  was  shown  that  a  Vn  for  which  (1-1)  holds  is  simply  harmonic  if, 
and  only  if,  all  the  latent  roots  of  the  matrix  F  =  [Fj],  where 

rj  =  R^rvr*,  (1-2) 

are  zero  for  all  points  (x1)  and  for  all  vectors  rl  thereat.  (In  the  earlier  paper 
the  arbitrary  vector  r*  was  denoted  by  /V.) 

It  was  also  shown  that  every  V2  satisfies  a  relation  of  type  (!•!),  with 


K  being  the  Gaussian  curvature.  The  curvature  or  other  geometrical  pro- 
perties of  spaces  Vn  (n>  2)  satisfying  (!•!)  have  not  been  fully  investigated 
so  far  as  I  know,  except  for  the  case  Kp  =  0,  when  they  are  symmetric  in  the 


*  An  approximately  chronological  list  of  earlier  papers  on  harmonic  spaces  appears 
at  the  end  of  this  paper. 
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sense  of  Cartan.  Pending  such  an  investigation,  I  call  any  Vn  satisfying 
(1-1)  with  ^=4=0,  whether  simply  harmonic  or  not,  by  the  unsatisfactory 
but  geometrically  non-committal  name  of  Jcappa-space,  and  denote  it  by 
Kn.  (1*1)  will  be  called  the  kappa-condition. 

This  paper  is  confined  to  the  case  n  =  4  because  this  case  may  be  dealt 
with  by  the  use  of  geometrical  methods  developed  in  earlier  papers.  The 
problem  of  determining  the  general  nature  of  simply  or  completely  harmonic 
spaces  for  n  >  4  remains  unsolved,  though  it  is  known  that  in  certain  cases 
they  are  flat  or  of  constant  curvature. 

In  §  2  I  obtain  the  most  general  form  for  the  curvature  tensor  in  any  J£4. 
In  subsequent  sections  attention  is  confined  to  the  case  when  the  K±  is 
simply  harmonic.  The  final  conclusion  is  that  the  metric  of  every  such  space 
is  reducible  to  the  form 

ds2  =  adx*  +  Zydxdy  +  fidy*  +  2dxdz  +  Zdydt, 

where  a,  ft,  y  are  functions  of  x,  y  only;  and,  conversely,  that  every  space 
having  a  metric  of  this  type  is  a  simply  harmonic  jfiT4. 

2.    The  Riemann  tensor  in  a  K± 

Let  g^  be  the  fundamental  tensor,  g  its  determinant  and  eijkl,  eijkl  the 
dualizing  tensors  of  non-zero  components  *Jg  and  l/^Jg  respectively.  If  g  <  0, 
the  latter  tensors  are  imaginary.  The  K^  is  not  at  present  assumed  to  be 
simply  harmonic,  nor  even  an  Einstein  space:  it  is  any  V±  satisfying  the 
kappa-condition. 

The  dual  of  any  bi vector  bti  ( =  -  b^)  will  be  denoted  by  °bij.  Thus 

°6«  -  ie*?fcV  (2-1) 

The  dual  of  the  Riemann  tensor  is  defined  by 

ojgtffci  „  Je^paeWr-JKp^.  (2-2) 

If  by,  cif  are  any  two  bivectors,  then  \^klb^cMt  which  is  equal  to  °bklckl 
and  to  °cijbii,  will  be  denoted  by  (°bc)  or  (°cb).  A  bivector  p^  is  simple  if 
(°PP)  =  0,  a  relation  that  may  also  be  written 

PwPu+PaiPu +Pi2Pu  =  0.  (2-3) 

In  the  protective  space  S3  associated  with  any  given  point  (x*)  of  J£4, 
the  components  p^  of  a  simple  bivector  are  Pliicker  coordinates  of  a  line, 
or,  otherwise  interpreted,  of  a  special  linear  complex.  The  components  of 
Op*i  are  dual  coordinates  of  the  same  line  or  complex.  The  equation 

*««vy=o 

is  that  of  a  quadratic  complex  of  lines,  the  Riemann  complex  (Ruse,  14). 
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The  kappa-condition  Rijia,p  ~ 

combined  with  Bianchi's  identity,  gives 

RmnptKr  +  RmuQrKp  +  RmnrpKt  =  0,  (2-5) 

which  may  also  be  written 


0. 
Multiplying  this  by  \eijmn  and  noting  that  ekml  =  €*&><*,  we  get 

°Bt^Kl  -  0.  (2-6) 

In  the  projective  space  $3,  °RiiM  are  dual  coordinates  of  the  Riemann 
complex  and  Kt  the  coordinates  of  a  plane  (of  equation  KtXl—  0  in  current 
point-coordinates  X1).  Let  qi  be  any  plane  other  than  /q.  Then 

Ctfs^RWfyft.  (2-7) 

are  the  coordinates  of  the  complex  conic-envelope  of  the  plane  qit  the 
equation  of  the  envelope  being  C^w^w^  ~  0  in  current  plane  coordinates 
wt.  By  (2-6)  and  (2-7),  ^  =  Q 

and  so  the  conic  C^  lies  in  the  plane  Kt.  It  is  already  known  to  lie  in  qi9 
which  is  a  different  plane,  and  so  it  must  consist  of  a  pair  of  points  (not 
necessarily  distinct)  on  the  line  of  intersection  of  the  planes.  Thus  the 
complex  conic-envelope  of  any  plane  q$  is  a  point-pair  in  the  plane  Kt.  The 
quadratic  complex  must  therefore  consist*  of  all  the  lines  meeting  a  fixed 
conic  in  the  plane  *q.  This  will  be  called  the  base-conic. 

Assume  for  the  moment  that  the  base-conic  is  non-degenerate,  and  let 
r*  be  any  point  of  $3  not  in  the  plane  Kit  so  that  K^r1  =J=  0.  Let  S^  =  Ripq^pT^ 
be  its  complex  cone  —  the  cone  having  r1  as  vertex  and  passing  through  the 
base-conic.  Sq  may  be  identified,  if  desired,  with  the  F^  of  (1-2).  Then  it 
follows  quickly  by  use  of  (2-5)  that 


Therefore,  if  the  components  of  r*  are  normalized  so  as  to  make  (r^)2  =  —  1, 

We      aVe  RhVk  =  8uKtKk  +  StkKhKi-8hkKiKi-8ijKhKk  (2-8) 

(cf.  Ruse,  13,  (2-8)). 

*  It  can  be  proved  independently  that,  when  the  Kiemann  tensor  satisfies  (2-6)  for 
some  K  ^  then  it  also  satisfies  the  identities  given  in  Ruse  (13,  §  4),  for  the  coordinates 
of  a  conic  defined  by  the  quadratic  complex  of  lines  meeting  it.  In  the  latter  paper 
pairs  of  skew  suffixes  are  raised  by  means  of  the  e-tensor  without  the  distinguishing 
mark  °. 
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Let  lit  mt,  T^  be  three  planes  through  the  vertex  r*  of  the  cone  $y,  forming 
a  self-polar  trihedral  with  respect  to  the  cone.  Then 

##  =  lilj  +  mimj  +  niiip  (2-9) 

if  lit  nifr  nt  be  suitably  normalized  :  for  the  equation  of  the  cone  is  8y  X*Xj  =  0, 
and  so,  if  we  change  the  tetrahedron  of  reference  to  that  formed  by  the 
planes  lit  miy  nt  and  any  fourth  plane,  say  /q  itself,  we  get  new  coordinates 
'&,  where  |i  =  W  g»  -  «,Z<,  g»  =  «»Z*,  f«  -  «,Z«, 

in  terms  of  which  the  equation  of  the  cone  reduces  to  (g1)2  -f  (g2)8  4-  (£3)2  =  0, 
Equation  (2-9)  follows  at  once. 

Substituting  from  (2»9)  in  (2-8),  we  get 


where      lhi  =  l^  -  Z4icfcl     mM  =  ra^/q  -  w4  jcfc,     WM 

Thus  Zi^)  miit  nif  are  Pliicker  coordinates  of  three  lines  in  the  plane  /cz-  of  the 
base-conic,  forming  a  triangle  self-polar  with  respect  to  the  conic.  Since 
they  are  lines,  we  have 

(°K)  -  0  ==  (°mm)  =  (°nri)  (2-11) 

in  the  notation  of  §  2,  and  since  they  intersect, 

(°mn)  =  0  =  (°nl)  =  (°lm).  (2-12) 

Further,  as  they  all  lie  in  the  plane  Kit  we  have 

0ZtfK,  =  0  =  °mVKj  ==  Ori%.  (2-13) 

It  has  therefore  been  shown  that  the  most  general  form  for  the  Riemann 
tensor  in  a  K±  is  that  given  by  (2-  10),  where  the  simple  bivectors  lijt  m^,  nif 
satisfy  (2«11),  (2-12),  (2-13).  The  bivectors  are  not  unique  because  they  may 
represent  the  sides  of  any  triangle  self  -polar  with  respect  to  the  base-conic. 

Other  equivalent  forms  for  RMjk  may  be  found  by  taking,  instead  of  a 
self-polar  triangle,  some  other  triangle  specially  related  to  the  conic,  for 
example  one  consisting  of  a  pair  of  tangents  and  the  chord  of  contact 
(cf.  Ruse,  13,  (3*34)). 

Hitherto  the  base-conic  has  been  assumed  non-degenerate.  If  it  consists 
of  a  pair  of  distinct  lines,  the  Riemann  complex  then  consisting  of  the  pair 
of  special  linear  complexes  having  these  lines  as  directrices,  the  curvature 
tensor  is  reducible  to  the  form 


And  lastly,  if  the  conic  consists  of  a  repeated  line,  the  Riemann  complex 
then  being  a  linear  complex  taken  twice,  the  tensor  assumes  the  form 
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3.    The  Eiemann  tensor  in  a  simply  harmonic  K4 

If  the  KI  is  simply  harmonic,  it  is  among  other  things  an  Einstein  space 
forwhich  ^  =  0,  (3-1) 

and  the  Riemann  complex  in  S3  is  self-polar  with  respect  to  the  fundamental 
quadric  g^  (Ruse,  14,  71). 

Now  a  system  of  lines  meeting  a  conic  cannot  be  self-polar  with  respect 
to  a  quadric  unless  the  conic  either:  (a)  consists  of  a  pair  of  lines  which  are 
either  generators  of  the  quadric  (of  opposite  systems)  or  else  conjugate  tan- 
gents; or  (b)  consists  of  a  repeated  line  which  is  a  generator  of  the  quadric. 

It  has  been  shown  elsewhere  (Ruse,  6,  161)  that  case  (a)  is  impossible  in 
a  non-flat  completely  harmonic  space,  and  hence,  in  particular,  in  a  simply 
harmonic  space.  We  therefore  conclude  that  the  Riemann  tensor  must 
have  the  form  (2*15),  where  m^  represents  a  generator  of  the  fundamental 
quadric  and  is  therefore  such  that 

(°mm)  =  0  (3-2) 

and  °m^  =  ±  mir  (3-3) 

Equation  (3*2)  expresses  the  fact  that  m^  represents  a  line  of  S3  (simple 
bivector  of  K4)t  and  (3-3)  that  it  is  a  generator  of  one  or  other  system  (self- 
dual  or  anti-self-dual  bivector  of  -fiT4).  Thus 


where  there  is  now  no  distinction,  except  perhaps  for  sign,  between 
and  its  covariant  dual,  and  none  at  all  between  R^M  and  its  dual;  thus 


Because  of  these  facts,  equation  (2-6)  now  gives 

mMKf  =  0,  (3-6) 

which  states  that,  in  S3,  the  plane  Kt  contains  the  generator  m{j.  Therefore 
it  touches  the  quadric,  and  so 

0*lKtK,  =  0.  (3-7) 

Thus  Kt  is  a  null  vector  of  K±. 

Substituting  from  (3-4)  in  the  kappa-equation  (2-4),  we  obtain 


If  this  is  multiplied  by  fikl,  where  f*,kl  is  an  arbitrary  contravariant  bivector, 
W6get  rnijtp^vpmijt  (3-9) 

where  vp  =  KP  -  (m^^^Km^8). 

SUB.  2.    VOL.  50.     NO.  2399.  V 
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Substitution  of  (3-9)  in  (3»8)  at  once  gives  vp  =  ^Kpt  and  so 


(3*10) 

Now  by  the  permutation-formula  for  covariant  derivatives, 


sO, 
because  of  the  skew  symmetry  ofm^.  Thus 


But  by  (3-10), 

=  4(^,a  +  iACpAcff)mw, 

whence,  by  (3-11),  /cp  tf  =  Kq>p, 

that  ifl  ^     d*«  -  0 

thatl8j  8^"~a^~°- 

KP  is  therefore  the  gradient  of  a  scalar  AT,  and  we  have* 

KP  =  dK/daP.  (3-12) 

Equation  (3*7)  now  becomes     AJ/C  =  0.  (3*13) 

Now  differentiate  (2-5)  covariantly  with  respect  to  af.  In  the  equation 
so  obtained,  the  terms  involving  the  covariant  derivative  of  the  Riemann 
tensor  vanish  because  of  the  kappa-condition  and  because  of  (2«5)  itself, 
and  we  are  left  with 

RmmK.r.a  +  RmnvK.p.a  +  RmvTpK.Q.a  =  °-  (3*14) 

Multiply  this  by  g™1,  summing  for  nt  q.  The  first  two  terms  vanish  because 
the  Ricci  tensor  is  zero,  and  so  we  obtain 

Rm«n,«n,,  =  0-  (3-15) 

Now  multiply  (3-14)  by  (f*.  Then,  because  of  the  symmetry  properties  of 
the  Riemann  tensor  in  its  suffixes  and  also  because  KtPt8  is  symmetric  in 
p,  s,  we  get,  using  (3-15),  R  ra  _Q 

^mnpq'U    K,rta  ~~  v» 

whence  A2/c  =  0.  (3-16) 

Both  differential  parameters  formed  from  /c  are  therefore  zero. 

*  Consistency  of  notation  now  really  requires  K9  to  be  denoted  by  Kt  „,  but  as  a  rule 
the  comma  will  be  omitted. 
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4.    Fields  of  parallel  vectors  in  a  simply  harmonic  K^ 
A  set  of  functions  A*  defines  a  parallel  vector  field  if 

A*,,  =  0.  (4-1) 

The  conditions  of  integrability  of  these  equations  are 

.       JV  =  0,     1^PA'-0,     Ifo^A' -  0.     ...  (4-2) 

(cf.  Eisenhart,  12,  67).  If  the  first  of  these  is  satisfied,  then,  in  a  JjT4,  all  the 
subsequent  conditions  are  satisfied  because  of  the  kappa-condition, 
covariant  differentiation  of  which  gives  R^^p.q  =  (Ktptq  +  KpKq)Ryiu>  an(^ 
so  on.  Now  by  (3-4),  equation  (4-2)  is  equivalent  to 

mk,X  =  0.  (4-3) 

In  the  protective  space  $3,  this  means  that  A*  is  a  point  on  the  generator 
m^  of  the  fundamental  quadric.  Therefore  (4-3),  and  hence  (4-2),  admits 
two  distinct  solutions,  say  A*  ==(AJ,  A|)  (cr  —  1,2),  since  a  straight  line  is 
defined  by  any  two  points  upon  it.  Any  other  vector  satisfying  (4-3)  is  a 
linear  combination  ^A*.  of  the  A£,  01  and  0a  being  scalars.  Hence,  as  in 
Eisenhart,  loc.  cit.  68,  69,  with  his  p  equal  to  2,  it  follows  that  K±  admits 
fields  of  parallel  vectors  depending  upon  two  arbitrary  constants.  In 
other  words,  we  can  find  two  independent  vectors  A*,  /£*',  both  satisfying 
(4-1),  and  hence  also  (4-2)  and  (4-3),  such  that  any  other  vector  satisfying 
(4'1)  is  of  the  form  «Ai  +  6^,  where  a,  b  are  constants. 

Because  A*,  /i1  both  satisfy  (4-3),  they  represent  points  in  S3  that  lie  on 
the  same  generator  of  the  fundamental  quadric.  Therefore  they  lie  on  the 
quadric  and  are  conjugate  with  respect  to  it.  Hence 

0WA«A»  -  o  -  <7f^V  =  ?«AV.  (4-4) 

Thus  A*  and  /**  are  both  null  vectors  in  K4  and  are  perpendicular  to  one 
another. 

By  (2-6),  (3-5)  and  (3-4),  the  vector  /c*  also  satisfies  (4-3).  Therefore  it,  too, 
represents  in  S3  a  point  on  the  generator  m^.  Hence  it  is  expressible  in  the 
form  **  =  o-A1'  +  Tfi\  (4-6) 

Here,  however,  it  is  to  be  presumed  that  cr  and  r  are  not  constants,  but  are 
scalar  functions  of  the  xiy  since  there  is  no  reason  for  supposing  that  the 
covariant  derivative  of  AC*  is  zero.  It  turns  out,  indeed,  that  it  is  in  general 
not  so. 

5.    Special  coordinate-systems 

Again  following  the  argument  given  by  Eisenhart  (this  time  from  the 
middle  of  his  p.  69  to  the  middle  of  p.  70),  we  can  deduce  that  it  is  possible 
to  choose  a  coordinate-system  (x*)  in  the  simply  harmonic  K±  so  that  all 

V2 
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components  of  A*  are  zero  except  one,  say  A3,  and  also  so  that  all  com- 
ponents of  /**  are  zero  except  one,  say  /**.  Writing  py  q  for  A3  and  /*4,  we  have, 
in  that  coordinate-system,  ^  _ 


From  (4»4)  it  at  once  follows  that 

033  =  0  =  044  =  034»  (5*3) 

and  hence  that  g11  =  0  =  g™  =  012.  (5-4) 

By  (4-5),  (5-1)  and  (5-2),  AT1  =  0  =  K2, 

and  hence,  by  (5*3),  /cg  =  0  =  /c4.  (5-6) 

The  last  equations  show  that,  in  the  special  coordinate-system,  the  scalar 
K  is  a  function  of  x1,  x2  only. 

Moreover,  since  A*  and/**  both  satisfy  (4-2),  we  have,  by  (5-1)  and  (5-2), 


Because  of  the  symmetry  properties  of  R^M  in  its  suffixes,  it  follows  that 
any  component  of  the  Riemann  tensor  having  one  or  more  of  its  suffixes 
equal  to  3  or  4  is  zero.  Hence  the  only*  non-zero  component  is  #12i2- 

The  co  variant  derivatives  of  A*  and  ^  being  zero,  we  have,  from  (5-1) 
and  (5-2), 


Taking  i  =  1,  2,  4  in  (5-6)  and  i  =  1,  2,  3  in  (5-7),  we  get 

i- 


-- 

in  each  case  for  all  values  of/.  If  we  now  put  i  —  3  in  (5-6)  and  i  =  4  in  (5-7), 
we  obtain 


(5-11) 

\-*J)  W 

for  all  j. 

*  To  save  constant  reference  to  the  possibility  of  permuting  suffixes,  the  components 
for  which  (ijkl)  =  (2121),  (2112)  or  (1221)  will  be  regarded  as  effectively  the  same  as 
that  for  which  (ijkl)  =  (1212).  They  differ  from  it  at  most  in  sign. 
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It  will  be  convenient  hereafter  to  denote  (a;1,  #2,  a;3,  X*)  by  (x,  y>  z,  t). 
From  the  last  of  equations  (5-9),  with.?  =  3,  and  from  (5'10)  with^'  =  4, 
we  have 

Q  =  - 

and  similarly,  from  (5-8)  and  (5-11), 

O.- 

Hence  p  =  p(x,  y,  z),     q  =  q(x,  «/,  t). 

Now  transform  to  a  new  coordinate-system  (»'*)  s  (#',  yr,  z',  tf)  defined  by 


where  (f>(x,  y)  and  ijr(x,  y)  are  arbitrarily  chosen  functions  of  x,  y,  and  the 
integrals  are  taken  with  x,  y  constant.  Then  from  the  ordinary  law  of 
transformation  of  contravariant  vectors,  it  follows  at  once  that,  in  the  new 
coordinate-system,  the  vectors  A*,  ^  have  components 

A'*  =  (0,0,  1,0),     X'=(<>,0,0,1). 

Dropping  the  dashes,  we  may  conclude  that  there  exists  a  coordinate- 
system  in  the  simply  harmonic  K±  in  which  the  functions  p,  q  introduced 
in  (5-1)  and  (5-2)  are  equal  to  unity.  By  (5-10)  and  (5-11),  the  Christoffel 

symbols  j  .[  ,  {  >  are  zero  in  this  coordinate-system,  as  well  as  those  given 
in  (5-8),  (5-9).  So  we  now  have 


for  all  i,  j.  Thus  all  Christoffel  symbols  having  a  lower  index  3  or  4  are  zero. 
Equations  (5-3),  (5-4)  and  (5-5)  of  course  still  hold.  That  .R1212  is  the  only 
non-zero  component  of  the  Riemann  tensor  may  now  be  fairly  easily  verified 
by  direct  calculation. 

The  covariant  derivative  of  g^  is  zero.  Hence 

i-  (5-13) 

If  k  —  3  or  4,  the  ChristofFel  symbols  on  the  right  are  zero,  by  (5-12).  Hence 
all  the  non-zero  components  ofg^p  and  therefore  ofgij,  are  functions  ofx>  y  only. 
Consequently  -R1212  is  a  function  ofx,  y  only.  It  has  already  been  shown  that 
the  same  is  true  of  the  scalar  K. 
Now 


p  n 
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If  one  or  more  of  the  suffixes  i,j9  k,  Us  3  or  4,  this  is  identically  zero  because 
Eiild  and  the  Christoffel  symbols  are  then  zero.  Hence  the  kappa-condition 


is  satisfied  trivially,  both  sides  being  zero.  Again,  by  (5-14), 


- 

If  p  =  3  or  4,  this  is  zero  because  -B12i2  depends  on  x,  y  only,  and  because  of 
(5-12).  Hence,  inasmuch  as  ACS  =  0  =  /c4,  (5-15)  is  again  satisfied  trivially. 
There  remain  only  the  cases  in  which  p  =  1  or  2  in  (5-16).  Now 


by  (5-12),  that  is, 


Hence  by  (5-16),        5^,  =  g^^8ia-  ^1212  a^log  I  ^  I 


where 


(5-17) 


This  verifies  that  the  Riemann  tensor  does  in  fact  satisfy  the  kappa-con- 
dition. 

It  is  of  interest  to  note  that  (5-17)  bears  a  formal  resemblance  to  (1*3), 
which  holds  in  any  V2.  For  the  Gaussian  curvature  of  a  T^  is  precisely 

K  =  JK1212/gr, 
•^1212  then  being  the  only  component  of  the  Riemann  tensor. 

6.    Further  particularization  of  the  coordinate-system 

It  has  been  found  that  coordinate-systems  exist  in  which  gr33,  044,  </34, 
911,  9**>  912  are  all  zero  and  in  which  the  remaining  gr's  are  functions  of  x,  y 
only.  The  question  arises  whether  these  conditions  are  alone  sufficient  to 
ensure  that  the  space  should  be  a  simply  harmonic  K^.  The  answer  is  no, 
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because  it  is  also  necessary  that  the  Christoffel  symbols  {„.},  ( J .}  should 

I3?j    14^J 

all  be  zero  for  all  i,  j.  Most  of  these  do  in  fact  vanish  by  reason  of  the  above 
conditions,  but  not  all.  Those  not  identically  zero  are 


31/-\13/-w     \te       Sy^'  (6i) 

_  I  3  (  _         1^13/^23       °9ia\ 

32| 


41|       114,  ,  (M) 

42/  V     ,  ^  ~1"14  ,^        7". 

For  the  space  to  be  a  simply  harmonic  K±  these,  too,  must  vanish.  So  by 
(6»1)  we  must  have  ^  0 

%*-%*.  (6-3) 

dx        dy 

The  only  alternative  would  be  </23  =  0  =  gr31,  but  if  this  were  so  the  deter- 
minant |  gli  |  would  vanish  because  we  also  have  g11  =  0  =  gr22  =  g12. 
Similarly,  from  (6-2),  .  . 


But  because  <733>  gr44,  g34  are  all  zero,  the  fundamental  quadratic  form  is 


(6-5) 

By  (6-3)  and  (6-4),  g^dx-^-g^dy  and  ^efo  +  g^^/  are  b°^  exact  differ- 
entials, say  dg  and  dy.  £  is  determined  as  a  function  of  #,  y  except  for  an 
additive  constant  which  may  be  chosen  at  convenience,  and  the  same  is 
true  of  -37.  Taking  g,  ij  as  new  coordinates  in  place  of  #,  y,  remembering  that 
the  gr's  are  functions  of  xt  y  only,  we  get  ds2  in  the  form 


where  the  y's  are  functions  of  g,  17  only.  Writing  x,  y  in  place  of  g,  97  and 
a,  ft,  y  for  yu,  y22,  y12,  we  have: 

The  metric  of  any  simply  harmonic  K±  is  reducible  to  the  form 

ds*  ==adx*  +  Zydxdy  +  fidy*  +  Zdxdz  4-  2dydt9  (6-6) 

where  a,  /?,  y  are  functions  of  x,  y  only.  The  only  non-zero  component  o 
is  JR1212,  and  the  scalar  K  is  also  a  function  ofx,  y  only. 
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For  the  space  (6-6), 


EM! 


"a    y     1     0~ 
y    ft    0     1 
1000 
_0     1     0     0_ 

,  &»]  = 

"001       0  " 
000       1 

1     0   -a   -y 
0     1    -y   -/?_ 

(6-7) 


11 
4 
11 

Calculation  shows  that 


the  determinant  g  is  equal  to  unity,  and  the  only  non-zero  Christoffel 
symbols  are 


(6-8) 


(6-9) 


12 


22)      dy     2dx' 

= f* 

2l(      2dx'     122 


__ 


7. 

It  remains  only  to  show  that  every  P£  whose  metric  is  reducible  to  the 
form  (6- 6)  is  a  simply  harmonic  K^. 

That  it  is  a  K4  follows  at  once  from  the  work  of  §  5,  in  which  Kp  was 
determined  for  a  more  general  coordinate-system  than  that  of  (6*6). 

That  it  is  simply  harmonic  follows  almost  at  once.  Writing 


as  in  (1'2),  we  find  immediately  that,  whatever  r*,  all  the  Tj  are  zero  except 
FJ,  T|,  FJ  and  F|.  It  is  therefore  obvious  that  the  latent  roots  of  the  matrix 
[Fj]  are  all  zero.  Consequently  the  space  is  simply  harmonic. 

The  conclusions  of  this  paper  are  all  illustrated  by  the  example  of  a  simply 
harmonic  K±  given  in  Ruse  (10).  The  K^  in  question  was  given  by 


da2  = 


+  2dxdz  +  2rjdxdt  -f-  2xdrfdty 


where  T/  is  written  for  the  y  of  the  previous  paper,  F(q)  being  any  function 
of  rj.  If  we  put 

y  =  y\x, 

this  becomes       ds*  =  F  l^\  (ydx  -  xdy)2  +  2dxdz  -h  2dy  dt, 

\xj 

which  is  of  the  form  (6»6). 
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1.    Introduction 

The  following  theorems  concerning  the  Cesaro  summability  of  the  allied 
series  and  Fourier  series  of  a  function  f(t)  integrable  in  the  Lebesgue  sense 
are  known.*  In  stating  the  theorems  we  suppose  that  f(t)  has  period  2n 
and  is  integrable  L  in  (0,  ZTT).  We  write 

W)  =  «/(*  +  *)  -/(*  -  ')},   #(«)  =  K/<*  +  *)  +/(*  -  0), 

00 

and  denote  the  allied  series  and  Fourier  series  of  /(£)  for  t  —  x  by  2  ^nf^) 

00  '  »-l 

and  2  An(x)  respectively. 

n=0 


THEOREM  A. 

1 
(i)  lim  f(0  =  0(C,a),     (ii)  Hm  - 


then  S  ^(^J  ^5  summable  (C,fi)to  s. 

n-l 


*  See  L.  S.  Bosanquet  and  J.  M.  Hyalop  (8,  491-492),  and  the  references  there  given. 
•f  By  lim  g(t)  —  Z(C,a),  we  mean  that 


lim  —  I    (t  —  t 

t-+  +  QtaJ   0 


<2w  =  i     or      lim  g(t)  =  I, 


according  as  a>0  or  a  =  0;by  lim  g(t)  =•  l(C),  we  mean  that  lim  g(t)  =  Z((7,a)  for 

*->+o  <->-+o 

some  a  ^f  0.   In  theorems  A,  B  and  G  we  replace  the  more  usual  conditions  involving 

/*  ?^(tt)  r  T 

^—^du  by  equivalent  conditions  involving   I    fr(u)  cot  $udu;  in  this  form  the 
t     u  J  t 

theorems  can  be  more  easily  extended  to  functions  integrable  in  the  Cesaro -Perron 
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oo 

THEOREM  B.  //*  a  >ft+  1  ^  0  and  2  Bn(x)  is  summable  (C,fi)  to  s,  then 

n«l 

i  r* 

(i)  lira  ft(t)  =  0(<7,a),     (ii)  lira  -     i{r(u)  cot  %udu  =  s(C). 
t-++o  t-++onjt 


THEOREM  C.  For  £  J3n(#)  to  be  summable^  (C)  to  s,  it  is  necessary  and 
l 
lim  - 

^+O  TTJt 


sufficient  that  n=sl 


or,  what  is  equivalent,  that  i]s(t)  cot  \t  be  integrable%  (CL)  in  (0,  TT)  and 
1  T* 


-\ 

TTJ- 


THEOREM  D.  If  ft  >  a  ^  Oand  lim  $&(£)  =  s(C,  a),  Zften  S  ^nC^)  *s  summable 
(C,p)tos.  (->+0  —  ° 

oo 

THEOREM  E.  //a>/?+1^0  a7ic?  ZMJ^)  *5  summable  (C,fi)  to  s,  then 

lim  0($)  =«(C,a).  w"° 

<->+o 

00 

THEOREM  F.   For  S  ^/iC^)  ^°  ^e  summable  (C)  to  s,  it  is  necessary  and 

u=0 

sufficient  that  lim  <j>(t)  =  s  (C). 
t-++o 

These  theorems  have  been  extended  by  various  writers  to  cases  when 
f(t)  is  not  necessarily  integrable  L  in  (0,  277).  In  particular,  it  follows  from 
a  result  due  to  Bosanquet§  that  theorem  C  holds  in  the  form  stated  for 
functions  f(t)  integrable  (CL)  in  (0,  2n).  Further,  theorems  D  and  E  have 
been  extended  by  Burkill||  and  Bosanquet^J  to  functions  integrable  in  the 
Cesaro-Perron  sense  of  integral  order.  Their  results,  for  a  function  f(t) 
integrable  C^P  (A  a  positive  integer),  can  be  stated  in  the  following  form: 

oo 

THEOREM  D^  If  /#>a>A+  1  and  lim  $(t)  =  s(C,a),  then  2  An(x)  is 
summable  (O,  ft)  to  s.  '~>+0  n""° 


THEOREM  Ev  //  a>/?-t-l^A-fl  and  2  An(x)  is  summable  (C,j3)  to  s, 


*  Theorem  B  was  obtained  by  Paley  (13,  theorem  IV)  in  an  equivalent  form  in 
the  case  /?^=0;  the  case  —  1</?<0  can  be  obtained  by  adapting  a  method  used  by 
Bosanquet  (2,  theorem  4)  and  is  included  in  theorem  II  proved  below. 

f  I.e.  summable  by  Cesaro  means  of  some  order. 

J  For  the  definition  of  the  Cesaro  -Lebesgue  integral,  see  L.  S.  Bosanquet  (6). 

§   L.  S.  Bosanquet  (7,  71).  ||  See  J.  C.  Burkill  (10)  for  theorem  D!. 

•jf  See  L.  S.  Bosanquet  (4,  282)  for  theorem  Bx;  for  similar  extensions  of  theorems 
D  and  E  to  functions  integrable  in  the  general  Denjoy  sense,  see  L.  S.  Bosanquet  (2, 
theorems  2  and  4). 
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It  follows  from  these  results,  and  from  consistency  theorems,  that  theorem 
F  holds  in  the  form  stated  for  functions  integrable  (OP)  in  (0,  27r). 

In  this  paper,  we  suppose  that/(£)  is  integrable*  CAP,  where  A  may  be 
any  positive  number  (not  necessarily  integral)  or  zero,  and  obtain  extensions 
of  theorems  A,  B  and  C.  We  show,  in  fact,  that  theorem  B  holds  in  the  form 
statedf  for  functions  integrable  in  the  GXP  sense,  whilst  theorem  A  holds 
with  the  additional  hypothesis^  /?^A+1.  There  is  then  a  corresponding 
extension  of  theorem  C. 

Theorems  D  and  E  can  be  extended  hi  a  similar  way;  §  we  prove  the 
results  for  the  allied  series,  as  they  involve  extra  considerations  concerning 
the  integrability  of  i/f(t)  cot  \t. 

Finally,  we  deduce  a  result  concerning  the  existence  of 


J, 


t 

It  is  well  known  that  if  f(t)  is  absolutely  integrable,  then  (1-1)  exists  as  a 
Cauchy  integral  for  almost  all  values  of  x.  This  result  was  obtained  by 
Plessner,)!  who  later^J  extended  his  proof  to  show  that  if  /(/)  is  integrable 
in  the  <70P  or  special  Den  joy  sense,  then  (1-1)  exists  as  a  special  Den  joy 
integral  for  almost  all  values  of  x.  In  view  of  a  result  due  to  Marcinkiewicz 
and  Zygmund,**  it  appears  that  this  result  also  holds  for  the  (7AP  integral; 
iff(t)  is  integrable  in  the  CAP  sense,  then  (1-1)  exists  as  a  <7AP  integral  for 
almost  all  values  of  x. 

2.   Notation  and  preliminary  lemmas  involving  (CP)  integrals 
The  notation  corresponds  to  that  given  by  Bosanquetft  for  the  (CL) 
integral.  We  suppose  throughout  that  A  >  0  and  fi  «  max  (A—  1,  0). 

*  For  the  definition  of  the  <7A  P  integral,  see  J.  C.  Burkill  (9). 

•f  I  am  indebted  to  Dr  Bosanquet  for  the  suggestion  that  it  should  be  possible  to 
dispense  with  a  restriction  involving  order  of  integrability  in  the  extension  of  theorem 
B,  and  with  the  corresponding  restriction  f$~&  A  in  theorem  Ex.  Cf.  footnote  f  on  p.  333 
and  footnote  f  on  p.  334, 

%  It  is  shown  below  that  the  condition  ft  ^  A  4-  1  is  required. 

§  See  theorems  IV  and  V  below;  these  theorems  also  extend  theorems  Dx  and  Et. 

||  A.  Plessner  (14).  See  also  A.  S.  Besicovitch(l). 

j[  A.  Plessner  (15).  It  has  been  shown  by  Maroinkiewicz  (12,  65-68)  that  the  result 
does  not  hold  for  the  general  Denjoy  integral. 
**  J.  Marcinkiewicz  and  A.  Zygmund  (11,  6);  if  a  trigonometric  series 


^ 

is  summable  (C,k)  in  a  measurable  set  E,  where  k>  -  1,  then  S  (bncoanx-anainnx) 
is  summable  (<7,  k)  almost  everywhere  in  E.  n*al 

tt  L.  S.  Bosanquet  (6). 
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Iff(t)  is  integrable  <7AP  in  (a,  6),  we  write  f(t)  €  Cx  P  in  (a,  6),  and  denote 

rb 
the  value  of  the  integral  by  I  f(t)  dt,  or,  if  we  wish  to  emphasize  the  existence 

Ja>  /•& 

of  an  integral  of  order  A,  by  CAP     f(t)dt.  If  f(t)zCxP  in  (a,  6)  for  some 

J  a 

unspecified  A,  we  write  f(t)  e  (CP)  in  (a,  6). 

If  a>0  and/(0  and  |  d-t\ot-1f(t)€(GP)  in*  (c,d),  then  C«(/,c,d),  the 
ath  mean  of  /(#)  in  (c,  d),  is  definedf  by  the  equation 


The  mean  of  order  0,  CQ(f,  c,  d),  is  defined  to  be  equal  tof(d). 
If  a  ^  0,  p>  —  1  ,  and 


O,  we  write  J  f(x  +  h)~lh*>     (C,a) 

as  ^->0.  The  usual  related  notation  is  defined  in  the  obvious  way. 

If  Km  f(x  +  h)  =  /(or)  (O,  a),  /(£)  is  said  to  be  Ca  continuous  at  the  point  x. 

*-»-0 

In  most  of  this  paper  we  are  concerned  with  functions  integrable  (CP)  in 
an  interval  (0,o),  where  a>0.  If  a>0  and/(<)  and  (x-^^f^e^P)  in 
(0,  a;),  where  x  >  0,  we  write 


We  also  write  F0(x)  =  /(a;),  and  use  a  similar  notation  with  other  functions. 
Since  Fa(x)  =  xaGa(f)0Jx)/T(a+  1),  it  foUows  that  f(x)~kP   (C9a)  as 
a;  ->  +  0  is  equivalent  to 


as  a;  ->  +  0. 

We  now  state  some  well-known  properties  §  of  the  CXP  integral. 

*  Or  in  (d,c)  if  d<c. 

f  By  lemma  II  below,  the  value  of  Cx(ft  c,  d)  does  not  depend  on  the  order  of  in- 
tegrability  off(t).  In  the  definition  of  Ca  means  given  by  Burkill  (9,  220-221),  and 
hence  in  the  definitions  of  Ca  limits  and  Ca  continuity  given  previously,  it  was  assumed 
that  the  integrals  involved  were  of  order  max  (a  —  1,0);  in  view  of  lemma  V,  stated 
below,  this  restriction  is  no  longer  necessary. 

J  Th\iaf(x  +  h)~lh*(C9a)  as  fe->0  implies  that/(*)  and  |a?  +  fc-J|«-V(*)€(CP)  in 
(x,  x  +  h)  for  all  sufficiently  small  values  of  |  h  \  . 

§  For  lemma  I,  see  L.  S.  Bosanquet(S);  there  is,  of  course,  an  analogue  hi  which 

f 
f(t)  €  <7A  P  in  (a,  b  —  e),  and  I  /(«)  du  tends  to  a  limit  (C,  A)  as  t  ->  6  —  0.  For  lemmas  II 

J  a 

and  III,  see  J.  C.  Burkill  (9).  Lemma  IV  follows  by  induction  from  lemma  III  and  the 
definition  of  the  functions  Fr(t). 


334  W.  L.  C.  SARGENT  [16  Dec. 

LEMMA  I.  In  order  thatf(t)  e  OA  P  in  (a,  6),  it  is  necessary  and  sufficient  that 

Cb 
(i)/(J)€(7AP  in  (a  +  e,b)  whenever  0<e<6-a,  (ii)      f^dueC^P  in  (a,6) 

and  tends  to  a  limit  (C,  A)  as  £-»-  a  +  0.  The  value  of  the  limit  is  then  equal  to 


LEMMA  II.  //  A  <  A'  and  /(£)  €  <7A  P  in  (a,  6),  thenf(t)  e  <7A.  P  tn  (a,  6). 


LEMMA  III.  Iff(t)eC^P  in  (0,  a),  cm<#  */  gr(i)  and  its  successive  derivatives 
are  all  bounded  in  (0,  a),  thenf(t)  g(t)  e  <7AP  m  (0,  a),  araZ 


f  "/(')  ff  (*)  *  =  JW«)  9(a)  -C^P  f"-Pi 
Jo  Jo 


LEMMA  IV.  Iff(t)eCxP  in  (0,<x)  arw?  r  is  zero  or  a  positive  integer,  then 
Fr(t)  is  defined  for  Q^t^a,  and 

F    (t)  =  C. 

where  v  —  max  (A  —  r,  0). 

We  next  state  some  results*  which  will  be  given  in  a  subsequent  paper.  f 

LEMMA  V.  Iff(t)  €  (CP)  in  (0,  a),  then  F^t)  exists  for  almost  all  t  in  (0,  a), 
ifoL  >  0,  and  for  all  tin(Q,a),ifa^l.  Further,  ifF^a)  exists  and  a  >  ft  >  0,  then 

(2-1) 


LEMMA  VI.    //  (Ka<a',  p>  -1  and  f(t)~ltp(C,a)  as  £->  +  0,  then 


LEMMA  VII.  Ifa^Q,p>-l  and  f(t)  ~  It*  (<7,  a)  as  t  ->  -f-  0,  then 


w-j^i^'f 

as  2->-fO,   where  ft  =  max(<x  —  1,0).    Conversely,  if  f(t)e(CP)  in   (0,a), 


a>0,  and  if  FM)~ -(C.a—\)  as  ^->  +  0,  where  a^l,  p>  —  1,  then 

p  + 1  f 


*  For  similar  results  involving  (GL)  integrals,  see  L.  S.  Bosanquet  (6,  theorems  8 
and  9  and  lemma  1  )  .  For  the  proof  of  (  2-  1  )  in  the  case  when/(£)  and  (x  -  t)P-*f(t)  <-CyP 
in  (0,a;)  whenever  Q<x*£a,  where  y  =  max  (/?—  1,0),  see  J.  C.  Burkill  (9,  221). 

f  The  main  results  obtained  in  this  paper  are  contained  in  lemma  V,  and  arose  out 
of  a  suggestion  due  to  Dr  Bosanquet  (cf.  footnote  f  on  p.  332  and  footnote  f  on  p.  333). 
Lemmas  VI  and  VII  can  be  deduced  without  difficulty,  and  the  proof  of  lemma  VIII 
is  based  on  lemma  V  and  is  similar  to  one  already  given  for  the  CQP  integral  (cf. 
W.  L.  C.  Sargent  (17)  and  the  references  there  given). 
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LEMMA  VIII.  //  0  <  a  <  1,  0  <  a  <  6,  andf(t) e (GP)  in  (0, a),  then 


^ess-bound  |  Fa(x)  \ . 

0<x<a 

The  next  results  are  similar  to  results  obtained  by  Bosanquet*  for  the 
(GL)  integral. 


LEMMA  IX.  7/a^O,A'  =  max  (A,  a—  !),#>  —  l,q<p+  land(i)f(t)eC^P 
in  (0,a)  and  is  o(tp)  (G,  a)  as  £->-f  0,  (ii)  k(t)  is  defined  in  (0,a)  and  kr(t)  is 
in  (0,a),  for  r  =  0,1,2,...,  then  f(t)k(t)^G^P  in  (0,a)  and  is 

' 


The  proof  is  by  induction  with  regard  to  a.  It  is  based  on  the  lemmas 
already  stated,  and  follows  the  same  lines  as  that  of  a  similar  result  due  'to 
Bosanquet.  f  ' 

LEMMA  X.  If  n  is  a  positive  integer,  p>  —  1,  and  (i)  f(t)€(CP)  in  (Q,a) 
and  is  o(t?)  (G)  as  t~*  +  0,  (ii)  k(t)  is  defined  in  (0,a)  and  kr(t)  is  Otf-v-*-*)  in 
(0,  a),  for  r  =  0,  1  ,  2,  .  .  .  ,  then  in  order  thatf(t)  k(t)  €  (  GP)  in  (0,  a),  it  is  necessary 
and  sufficient  that  Fn(t)kn(t)z(GP)  in  (0,a).  Further,  if  f(t)k(t)e(CP)  in 
(0,  a),  then 


f 

J 


O  r«=l  Jo 

It  follows  from  lemma  III  that,  whenever  0  <  t  <  a, 


f  af(u)  k(u)  du  =  F  S  (  - 


4-  (  - 

Since  /(O  is  o(^)(O)  as  «->4-0,  it  follows  from  lemma  VII  that  Fr(t)  is 
o(t*>-*)(C)  as  J->  +  0,  r  being  a  positive  integer.  Since  Fr(t)z(CP)  in  (0,a), 
it  foUows  from  lemma  IX  that  Fr(t)  kr~*(t)  €  (CP)  in  (0,  a)  and  is  o(l)  (C)  as 

ra  fa 

t-+  +  Q.  Hencet  if  one  of  the  functions      f(u)k(u)du,      Fn(u)kn(u)du€.(CP) 

Jt  Jt 

in  (0,  a)  and  tends  to  a  limit  ((7)  as  t  -»•  -f  0,  then  so  does  the  other.  The  results 
stated  follow,  by  lemma  I. 

LEMMA  XI.  //§/(£)  e  (CP)  in  (0,  a),  then  in  order  that 

£),  (2-2) 


f 
J  t 


*  Cf.  L.  S.  Bosanquet  (6,  theorems  2,  18  and  19). 
t  L.  S.  Bosanquet  (6,  46-48). 

$  We  use  the  consistency  results  given  by  lemmas  H  and  VI.  These  results  are 
implied  by  the  notation,  and  we  shall  not  refer  to  the  lemmas  explicitly  every  time. 
§    Cf.  R.  E.  A.  C.  Paley  (13,  176-176)  and  L.  S.  Bosanquet  (6,  56)  for  similar  results. 
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where  $"&(),  it  is  necessary  and  sufficient  that 

0     (C9d+l),  (2-3) 


and  lim  du  =  5     (C).  (2-4) 

t-++o  J  t    u 

It  follows  from  lemma  III  that  t~lf(t)  €  (CP)  in  (?/,  a)  whenever  0  <  y  <a. 

Since  (2-4)  implies  that  I    -^—^  du  €  (CP)  in  some  interval  (0,  ?/),  if  >  0,  whilst, 

f0/^)  ^ 

by  lemma  I,       -^due(CP)  in  (??,«),  it  follows  from  lemma  I  that  (2*4) 

Jt    u 
is  equivalent  to 


^du  -  s.  (2-5) 

o    v> 

Since  (2-4)  is  included  in  (2-2),  we  assume  throughout  that  (2-5)  is  satis- 
fied; (2-2)  is  then  equivalent  to 


lim  du  =  Q     (C,S).  (2-6) 

t-++ojo   u 

We  therefore  have  to  show  that  (2-3)  follows  from  (2-6)  and  vice  versa. 

Write  g(t)  =  f  ^-du  for  0  ^  t^  a.  Then,  by  lemma  III, 
Jo    u 


If  (2-6)  is  satisfied,  it  follows  from  (2-7)  and  lemmas  VI,  VII  and  IX  that 
\  is  o(t)  (C,  d)  as  t-*  +  0,  and  hence,  by  lemma  VII,  (2-3)  is  satisfied. 

It  remains  to  show  that  (2-6)  follows  from  (2-3).  Since  g(t)  is  o(l)  (C)  as 
J-»4-  0,  by*  lemma  I,  we  may  suppose  that  g(t)  is  0(1)  (C,  8+n)  as  i 
where  n  is  a  positive  integer.  Since 

lf'' "  *       (0<*<a) 


by  (2-7),  it  follows  from  (2-3)  and  lemmas  VI,  VII  and  IX  that  g(t)  is  o(l) 
((7,  8  +  n—  l)as£->  +  0.  By  n  —  1  further  applications  of  the  argument  used 
above,  it  follows  that  (2*6)  is  satisfied. 


LEMMA  XII.  Iff(t)eC^Pin  (0,a),  then  r^dusC.Pin  (0,a). 

Jt    u  * 

Let  n  be  a  positive  integer  such  that  n  ^  max  (A  4-  1,  2).  Write 


*  Or  by  (2-4)  and  (2-5). 
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for  0  <  t^  a,  and  take  v  such  that  0  <  v  <  a.  Then,  by  lemma  III, 


If  r  is  a  positive  integer  greater  than  2,  it  follows  from  lemmas  II  and  IV 
that  Fr(t)eCvP  in  (0,a),  where  v  —  max(/*  —  1,0).  Also  Fn(t)  is  continuous 
for  0  ^  t  ^  a.  Further,  since  F^(t)  is  o(  1  )  (<7,  A)  as  t  ->•  +  0,  by  lemma  I,  it  follows 
from  lemmas  VI  and  VII  that  Fr(t)  is  ©(r-1)  (C,/0»  whilst  Fn(t)  is  op*-1) 


(C,  0),  as*-^+  0.  It  therefore  follows  from  lemma  IX  that  £  (r-  1)!  t^FXt;), 

r-2 

considered  as  a  function  of  v,  is  integrable  CVP  in  (0,a)  and  is  o(l)  (C,/i) 
as  i>->  +  0.   Further,  it  follows  from  the  properties  of  Fn(t)  stated  above 

ftt 

that  v\   t~(n+^Fn(t)dt  is  bounded  in  (0,a)  and  is  o(l)  as  v->4-0.   Hence 
Jt? 

Cp  P  \*0(t)  dt€.CvP  in  (0,  a)  and  tends  to  F^a)  (  C,  p)  as  v  ->  -f-  0.  It  therefore 

•/  V 

follows  from  lemma  I  that  0(t)  €  C^P  in  (0,  a). 

LEMMA  XIII.  //  a  ^  0,  /(<)  eC^P  in  (0,  a),  emd 

Hm/(i)  =  0     (O,a),  (2-8) 


^  =  s     (C),  (2-9) 

U 

then  t~*f(t)  €  C^P  in  (0,  a),  w^ere  A'  =  max  (A,  a  -  1),  and 

faf(u)J 

J-^-Ldu  =  s. 
Jo    u 

Since  /W  is  o(l)  (Cf,  A'+  1)  as  t->  +  0,  by  (2-8)  and  lemma  VI,  it  follows 
from  (2-9)  and  lemma  XI  that 


lim  du  =  s     ((7,  A'). 

t-^+ojt    u 

ra  f(u) 

Since   I    -  —  -dueC^P  in  (0,a),  where  pf  —  max  (A'—  1,0),  by  lemmas  II 

and  XII,  the  result  stated  follows  from  lemma  I. 

We  obtain  one  more  lemma*  which  involves  integrals  of  a  sequence  of 
functions. 

*  This  lemma  is  fundamental  in  the  proof  that  summability  (  C,  ft)  of  the  Fourier 
series  and  allied  series  of  a  function  integrable  C^P  is  a  local  property  if  ft^  A  +  1. 

SBB.  2.     VOL.  50.     NO.  2400.  W 
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f* 
LEMMA  XIV.    //  0<g<l  and  F(t)  =  CQP\  f(u)du  for  o<^<6,  where 

J  a 

f(t)  is  Cq-continuous for*  a^t^b,  and  if,  for  a^t^b, 

(»=1,2,3,...),  (2-10) 


kn(u)du 


(n=l,2,3,...)5  (2-11) 


f& 
where  A  is  constant,  then     lim  I   F(t)  kn(t)  dt  —  0. 

»->-ooJa 

If  q  ==  1,  the  result  stated  follows  from  Lebesgue's  convergence  theorem,  f 

We  therefore  suppose  that  0  <  q  <  I  .  For  convenience  we  shall  also  suppose 

that/(£)  vanishes  for  t  <  a  and  for  t  >  b.  We  proceed  to  show  that  every  point 

c  of  the  closed  interval  (a,  6)  is  contained  in  an  open  interval  (c  —  3,  c  +  #), 

such  that  rd 

lim      F(t)kn(t)dt  =  Q,  (2-12) 

n->oo  J  c 

whenever  c  —  d<d<c  +  $.  The  result  stated  follows  without  difficulty,  by 
means  of  Borel's  covering  theorem. 

Since  /(a?)  is  (^-continuous  at  the  point  c,  we  can  find  $  such  that  Cq(f,  c,  x) 
is  bounded  for  0  <  \  x  —  c,  \  ^  $.  We  shall  suppose  that  c  <d  <  c  -h  8\  the  case 
when  c  —  d<d<c  can  be  considered  in  a  similar  way.  We  shall  also  suppose 
that  F(c)  =  0;  this  is  clearly  legitimate,  in  view  of  (2-11). 


Write  g(x)  =  ~  j  \x  -  t)^f(t)  dt     (c<x<  d)} 


(c<x<d). 

Then  g(x)  is  bounded  in  (c,d),  and  it  can  be  shown  by  a  standard  methodf 
that  hn(x)  is  bounded  in  (c,  d).  Further,  if  c  ^  7  <d,  then,  by  lemma  V, 


*  It  is  understood  that/(<)  need  only  be  Cq  -continuous  on  the  right  at  a  and  on  the 
left  at  6. 

t  This  may  be  stated  as  follows  :  if  F(t)  e  L  in  (a,  6)  and  the  functions  Jcn(t)  are 

fA 
uniformly  bounded  in  (a,  6),  and  if  lim     I    kn(t)dt  —  0   whenever   a^A^6,  then 

/•&  n  ->-  oo  J  a 

lim 


/•& 
lim 
->  a>  J  a 


If  x<d—  1/n,  we  consider  the  integrals  in  (x,  x+lfn),  (x+l/n,d)  separately 
See  L.  S.  Bosanquet  (3,  196-197). 
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where  y  <  g  <  d,  so  that,  by  (2-11), 

Jd 
hn(t)dt^  0. 
y 

It  follows  from  lemma  V  that 

F(x}  =  T(f=T)  JT(a:  ~  ° 

for  c  <  a?  <dy  and  hence,  by  Fubini's  theorem, 
f*F(t)kn(t)dt  =  j^ 


It  therefore  follows  from  Lebesgue's  convergence  theorem  that  (2*12)  is 
satisfied.  The  lemma  is  thus  established. 


3.    Results  involving  allied  series 

We  suppose  throughout  this  section  that/(£)  is  periodic,  with  period  2n, 
and  that  /(t)  e  (7A  P  in  (0,  2n).  We  write 

0  -/(*-*)},     «n  +  *n  =  -  I2"f(t)enitdt, 

"JO 
n 

Bn(x)  =  6*  cos  9W  -  an  sin  raa,     5W  =  S  Bv(x),      t 

v«,o 

and  denote  the  nth  Cesaro  mean*  of  order  ft  (/?>—!)  of  sn  by  s%.  Then 

00 

2  Bn(x)  is  the  allied  series  off(t)  for  *  =  a?,  and 


(3-1) 

n-l 
1  C* 

=  -C^P\    &(t)  {cot  %t-  cos  (%  +  J)  icosec  JQdi,  (3-2) 

K  JO 


o 
where  A;n(^)  is  the  nth  Cesaro  mean  of  order  ft  of  cos  (n  +  J)  t  coseo  %t. 


*  I.e.  *J  =  -JTJ  S  A%^,av,  where  Arn  is  the  coefficient  of  xn  in  the  expansion  of 

-»ni"-  0 

(1  —a?)-^"-1  in  ascending  powers  of  a;;  it  is  clear,  of  course,  that  J30(#)  =  0  and  80  =  0. 

W2 
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If  fi^  0,  it  can  be  shown*  that 


Bnr+l(nt)~p  t^n  (3-5) 

where  r  is  zero  or  a  positive  integer,  p  —  min  (r  +  2,  ft  +  1),  and  .4  and  B  are 
constants,  which  may  depend  on  r  but  are  independent  of  n  and  £. 

If  p  >  0,  we  write  Cp(t),  Cp(t)  for  Young's  functions,  defined  by  the  equation 


It  is  well  known  that  Cp(t)  is  bounded  for  t^O,ifQ<p^2,  whilst  t2-*>Cp(t) 
is  bounded  for  t  >  0,  if  p  >  2.  Further,  if  m  is  an  odd  positive  integer,  thenf 

l(m-l)     (_iy£p+2r  _ 

O-  (3>6) 


We  next  give  three  more  lemmas. 

LEMMA  XV.  If  a>  ft  +1^0  and  TtUn  is  summable  (C,ft),  then 
lim      S0.(««)/>"  =  o(^) 


a«  «  ->  +  0,  an^  -j  lim     S 

t         />->!—  0  w=l  ^ 

in  (0,7r). 

Lemma  XV  has  been  obtained  by  Bosanquet  J  in  the  case  when  ft  ^  0, 
and  can  be  obtained  without  difficulty  when  §  —  1  ^  ft  <  0. 


*  (3-4)  is  easily  verified  since  cot  £2  —  cos  (n -f  £)  J  cosec  ££  =  2  S  sinrt,  whilst  (3-5) 

v=o 

follows  from  working  due  to  Zygmund  (20,  258-269). 
f  Easily  verified  by  integration  by  parts. 
J  L.  S.  Bosanquet  (7,  lemmas  7  and  8).  It  is  shown  in  lemma  8  that 


tends  to  a  finite  limit  as  $->  +  0,  and  hence  — —     lim      £   — ^J3a(nt)pne  C0P  in  any 
interval  (0, a),  where  a >0.  /?->i-On=ina  ^ 

§   Since  C?a(ni)  =  C7a+1(n«),  it  can  easily  be  verified  that  the  series  X  —  <?a(wO  is, 

in  this  case,  uniformly  convergent  for  t^O.  Since,  for  fixed  n,  Ca(nt)  is  O(fa+1)  as 
t  -*  +  0,  the  results  stated  follow  without  difficulty. 
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LEMMA  XVI.  In  order  that 

I  Cn 
lim  -  I    -df(u)  cot  \udu  =  8     (C,8), 

t-++o7Tjt 

where  d^O,  it  is  necessary  and  sufficient  that 

lim  1^(0  =  0     (C,$+l), 

t^+o 

i  r* 

and  lim  -      i/r(u)cot  \udu  =  a     (G). 


Write  g(t)  »  tfr(t)cot%t  for  0<t^n.  Then,  by  lemma  III,  g(t)eGAP  in 
(0,7r).  Also,  by  lemma  IX,  lim  ^($)  =  0  (C,  8+  1)  is  equivalent  to  lim  g(t)  =  0 


(C,S+1).  The  result  stated  therefore  follows  from  lemma  XI. 

LEMMA  XVII.  //  a  ^  0  and 

lim  ^(0  =  0     (C,a), 

«-*•+<> 

1  f7' 
lim  —  I    ^(w)cot|wc2^  =  s     (C), 


then  ijr(t)  cot  \t  e  (7A*  P  in  (0,  a),  where  A'  =  max  (A,  a  —  1), 


Define  g(t)  as  in  lemma  XVI.  Then  lim  g(t)  =  0  ((7,  a),  and  the  result 
stated  follows  from  lemma  XIII.  <~>+0 

We  now  obtain  the  main  theorems. 

THEOREM  I.  //  a  ^  0  and* 

lim  ^r(t)  =  0     (C,a),  (3-7) 


lim  --  f  nft(u)  cot  Jw^  =  s     (C),  (3-8) 

<-»-+o7rje 

00 

then  2  J3n(#)  is  summable  (  (7,  /?)  to  5,  w/iere 


*  If  a^  1,  it  follows  from  lemma  XVI  that  (3-7)  and  (3-8)  are  equivalent  to 
lim  - 

t^+oTT 

if  0<a<  1,  then  v>(<)  =  o(l)  ((7,  1)  as  *->  +  0,  by  (3-7)  and  lemma  VI,  and  hence  the 
limit  in  (3'  8)  exists  in  the  ordinary  sense. 
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We  first  point  out  that  the  condition*  ft  >  A  -f  1  is  required. 
there  is  afunctionf/(£)  integrable  in  the  <7AP  sense  in  (0,  2n),  and  continuous 
except  at  the  origin,  for  whichf  an  3=o(n#)  and  bn=£o(nf)  as  n-+oo.  Thus 
(3-7)  is  satisfied  with  a  —  0,  whenever  0  <  x  <  2n,  and  it  can  be  deduced 
from  the  result  mentioned  above  due  to  Plessnerf  that  (3*8)  is  satisfied  for 
almost  all  values  of  x  in  (0,  2n).  On  the  other  hand,  since  an^o(n^)  and 
bn  =£o(nP)  as  n->co,  TZBn(x)  can  only  be  summable  (C,fi)  is  a  set  of  measure 
zero. 

In  view  of  consistency  theorems  for  Cesaro  sums  and  Cesaro  limits,  it  is 
sufficient  to  consider  the  case  when  a  =  m+p,  /?  =  m  +  #,  where  m  is  zero  || 
or  a  positive  integer,  0  <  p  <  q  ^  I  and  m  +  q  ^  A  +  1  . 

Since  m  +  g  >  A  +  1  ,  it  follows  from  lemmas  II  and  IV  that 


o 

rt 
and,  in  the  case  when  m>0,  *¥m(t)  =  CqP\   ^Ym_^(u)du  for  Q^t^n.  Thus 

Jo 

x^m+i(0  *8  continuous  for  O^t^Tr  and,  if  m  >  0,  Tm(£)  is  Cq-continuous  for 
.  Further,  since  m  -f  1  >  a,  it  follows  from  (3-7)  and  lemma  VII  that, 

((7,0).  (3-9) 

It  follows  from  (3-7),  (3-8)  and  lemma  XVII  that  &(t)cot$te(CP)  in 

1    f* 

(0,77)    and   ~     i/r(t)cot%tdt~8.    It    therefore    follows    from    (3-3)    that 
71  Jo 

in  (0,7r),  and 


o 
Hence,  by  (3-4),  (3-7)  and  lemma  X,  xFw^1(0^+1(Oe(C'P)  in  (0,*r),  and 


r-l  Jo 

=  0(1)  +  (  -  1  )~  f  Vm+1(i)  *»+!(*)  dt, 

Jo 


t  by  (3-5). 


*  Cf.  S.  Pollard  (  16,  216)  for  a  similar  argument.  It  can  be  seen  from  the  arguments 
given  below  that,  if  /?;&  A  +  1»  summability  (C,  ft)  of  SJ3n(a;)  is  a  local  property. 

t  W.  L.  C.  Sargent  (18)  ;  for  a  similar  construction  in  the  case  A  =  0,  see  E.  C.  Titch- 
marsh(19). 

J  It  can  be  deduced  from  lemma  XIV  that  an  and  bn  are  o(n*+l)  as  n  ->  cx>. 

§  If  r  is  any  positive  integer,  then/(£)  is  absolutely  integrable  in  (  1/r,  2?r),  and  hence 
the  limit  in  (3-8)  exists  in  the  ordinary  sense  for  almost  all  x  in  (1/r,  2n). 

|J   m  can  be  zero  only  if  A  =  0  and  q  =  1. 
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Since,  by  (3-4),  (3-9)  and  the  continuity  of  Tm+1(J),  W^^t)  k%  +1(t)  e  O0  P 

fn 

in  (0,  n)  and  is  o(l)  (C,  0)  as  £-v  +  0,  whilst,  by  lemma  I,      ^m+1(u)  k^(u)  du 
tends  to  a  limit  (C)  as  £->  +  0,  it  follows  from  lemma  XIII  that 


in  (0,7r).  It  therefore  follows  from  (3-4)  and  the  continuity  of  XFW+1(^)  that 
QP  in  (0,7r),  and  hence,  by  (3-4)  and  (3-9), 


lim 

/o 

If  m>0,  it  follows  from  (3-5),  lemma  XIV  and  the  <7tf-continuity  of 
*Vm(t)  that,  whenever  0  <  ?/  <  TT, 

lim      ^m+^(t)fc^+I(t)dt  —  0, 

n— >oo  J  rf 

whilst,  if  m  =  0,  this  result  follows  from  Lebesgue's  convergence  theorem. 

In  order  to  prove  that  X  Bn(x)  is  summable  (<7,  ft)  to  s,  it  therefore  remains 
to  show  that 


lim  lim  If 

i/->0  «->»  I  J  1/n. 


:    0. 


i- 

The  proof  of  (3-10)  follows  standard  lines.  In  view  of  (3-7),  we  can  take 
77  so  that  *¥a(t)  is  bounded  in  (0,  ?/).  Since 


for  Q<t^7T,  by  lemma  V,  it  follows  from  Fubini's  theorem  that 

r  ^«H.iw 

Jl/» 


l/n 


It  follows  from  (3-5)  and  the  second  mean  value  theorem  that,  whenever 
^  '  (t  —  u)-p k™+l(t) dt  ^2Bnm+1(l/n  —  u)~p9 

Jl/n 

and  it  can  be  shown  by  a  standard  method*  that 


IJtt  n 


for  l/n  ^u^ij,  where  C  is  a  constant  independent  of  n  and  rj. 

Since  Ta(e)  is  o(ta)  as  «->  +  0,  by  (3-7),  it  follows  without  difficulty  that 
(3-10)  is  satisfied. 

*  Cf.  L.  S.  Bosanquet  (3,  196-197). 


lim  -  f 

t-++o7Tji 
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QO 

THEOREM  II.  // a > /?+ 1  ^  0  and  £  Bn(x)  is  summable  (C,  ft)  to  s,  then 

n-l 

\u  =  8     (C).  (3-8) 

By  increasing  ft  if  necessary,  it  is  sufficient  to  consider  the  case  when 
/?+ 1  «x  <  [/?]  +  2.  Then  a  =  ra -f-#,  where  m  =  [/?]  +  1  and  0 <p ^  1.  We 
shall  suppose  that  m  is  odd;  if  m  is  even  (or  zero)  the  procedure  is  similar. 

It  can  be  deduced*  from  (3-1)  that  there  is  a  function  G(t)  which  differs 
from  *¥m(t)  by  a  polynomial  of  degree  m  —  1  and  is  such  thatf 

G(t)  ~  ( -  1  )*<»M-I>  £  _J!J  cos  Tie,     G±(t)  ~  ( - 1  )Km+i>  £  -^  sin  w«, 

where  Bn  =  -Bn(a;). 

Since  the  (CP)  integral  XFOT+1(0  is  (^-continuous,  so  is  G±(t).  It  therefore 
follows  from  theorem^  D19  due  to  Burkill,  that  the  Fourier  series  of  G^t) 
is  summable  (C)  to  G^t)  for  all  t.  Since  J5W  is  o(nP)  as  rz.^-oo,  and  m>  ft,  this 
series  is  uniformly  convergent  for  any  range  of  values  of  t,  and 


Take  ^,  t  such  that  0<^<^;r.  Then,  by  lemma  III  and  the  uniform 
convergence  of  the  series  on  the  right, 


r(t-v] 

=  («-  a)*-1  ^(M)  -(1-^)  r^- 

Jo 


x  j  (£  —  w)^-1  sin  nu  —  n\    (t  —  v)p~l  cos  Tivrfv  J 

D  £  ^  P(«-t7)P-l 


*  If/(*)~i«o+  S  (ancosn*  +  6nsinn«),  where  /(<)  has  period  2?r  and/(0€(<7P)  in 

n=l 
(0,  27r),  integration  by  parts  gives 

a        1   f2fll  ft  1    r2" 

"  =  -         {^(O-K^sinn^^,     ^  =  _i         {JT^-la^coenKfc. 

n       irjo  n          TrJo 

f  ^(t)  is  defined  as    I    G(u)du,  in  the  usual  way,  and  hence  (^(O  differs  from 

J  0 
^m+i(*)  by  a  polynomial  of  degree  wi.  $  Stated  in  the  Introduction. 
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If  0  <  u  ^  t,  then,  by  lemma  VIII, 

(t  —  v)p-1cosnvdv  Abound       (y  —  v)p~1cosnvdv 
IJo  o<v«  |Jo 

=  bound  |  T(p)n-»  CP(ny)  \ 


where  K  is  constant.  Since  Bn  is  o(n0)  &sn->ao,  and  ra+jp  —  /#>l,it  follows 

ru 
that       (t  -  v)p~l  G(v)  dv  is  bounded  in  (0,  t)  and  tends  to 

Jo 

nCp(nt) 


as  M->£  -  0.  Hence,  by  lemma  I  and  (3-6),  (t-v)?-1  Q(v)  €  (CP)  in  (0,  t),  and 


i<»»-l)(— 
V 


_ 
(  _  ]  U(m+l)  Q         ( 

(     L)          °-+p 

Since  ^n2r~mBn  is  summable*   (G)  and  therefore  summable   (Jl)  for 
r  =  0,  1,  2,  ...,  i(m-  1),  it  follows  that 


where        cr  =    lim    (  £  (-  l)warHn+1>»2r-wlJ5w/on)/ 

/>-^l-0  ln-1  )/ 

and  hence| 


where  P(£  )  is  a  polynomial  in  t  of  degree  m  —  1  . 
It  follows  from  (3-11)  and  lemma  XV  that,  as  £- 


Since,  however,  T1(^)  is  o(  1  )  (  C)  as  £  ->  -f-  0,  by  lemma  I,  it  follows  from  lemma 
VII  that$  xYm(t)  is  o(tm~*)  (C)  as  t->  +  0.  In  view  of  lemma  VI,  the  poly- 
nomial P(t)  must  therefore  be  identically  zero,  and  hence  ^¥m(t)  is  o(tm)  (C,p) 
as  ^->+  0.  It  follows  from  lemma  VII  that  (3-7)  is  satisfied. 

*  Cf.  L.  S.  Bosanquet  (4,  277). 

t  (3-11)  can  be  established  whether  m  is  even  or  odd,  P(t)  being  zero  if  m  is  zero. 

J  If  m  —  0,  this  argument  is  not  required. 
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Since  P(t)  is  identically  zero,  it  follows  from  (3-11)  and  lemma  XV  that 

rt 
t-<m+P+vl  (t-u)»-llk¥m(u)du€CQP  in  (0,77).  This  result  holds  whenever 

Jo 

m  +  l^m+p>fi+l,  and  hence,  taking^  =  1,  ^m+2)x*Vhi(£)€<70P  in  (0,7r). 
Since  (3-7)  has  already  been  established,  it  follows  from  lemma  X  that 
t~lifr(t)  e  (GP)  in  (0,  TT)  ;  (3-8)  then  follows  from  lemmas  III  and  I  and  theorem  I. 

GO 

THEOREM  III.  In  order  that  2  Bn(x)  oe  summable  (C)  to  s,  it  is  necessary 

n=l 

and  sufficient  that  ft(t)  cot  \t  e  (CP)  in  (0,  TT),  and 


00 

By  theorems  I  and  II  and  lemma  XVI,  in  order  that  S  B<n(x)  ^e  sum- 
mable  (C)  to  s,  it  is  necessary  and  sufficient  that  waal 


lim  -I   ^f(u)cot^udu  =  s     (C), 

t->  +  Q7TJt 

and  this  is  equivalent*  to  the  condition  given. 

4.    Results  involving  Fourier  series 

The  following  theorems  can  be  obtained  by  methods  similar  to  those  used 
for  the  corresponding  theorems  on  allied  series.  In  stating  the  theorems 
we  suppose,  as  before,  that/(Z)  has  period  2n  and  is  integrable  (7AP  in  (0,  2n). 
We  write  <f>(t)  =  %{f(x-i-t)+f(x  —  t)},  and  denote  the  Fourier  series  of  f(t) 

for  t  =  x  by  £  ^»(«). 


THEOREM  IV.  //  a^  0  and  lim  0(£)  —  s  ((7,  a),  £^e/&  S  An(x)  is  summable 

t-++Q  n«0 

(C,fi)  tos,  where  /? 


THEOREM  V.   //  a>/?+1^0  awd  2^w(^)  *5  summable  (C,fi)  to  s,  then 
Iim6(t)  =s(C,a). 

t^  +  Q 

oo 

THEOREM  VI.  In  order  that  2  ^n(^)  oe  summable  (C)  to  s}  it  is  necessary 

n=0 

and  sufficient  that  lim  <fi(t)  =  s  (C). 


*  By  lemma  I;  cf.  the  argument  at  the  beginning  of  lemma  XL 
f  By  arguments  similar  to  those  used  in  theorem  I,  the  condition 
required. 
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It  should  be  observed  that  theorem  IV  extends  theorem  Dx  to  the  case 
a <  A+  1,  as  well  as  to  non-integral  values  of  A;  theorem  V  extends  theorem 
Ex  to  the  case  —  1  </?<  A.  Theorem  VI  follows  from  theorems  Dj  and  Ex  or 
from  theorems  IV  and  V. 

5.    Existence  of  (CP)  (*f(x  + t}  ~~f(x ~  *} dt 
Jo  t 

THEOREM  VII.  Iff(t)  e  <7A  P  in  ( -  TT,  2n),  then 


i: 


t 


exists  as  a  CAP  integral  for  almost  all  x  in  (0,  TT). 

Since  {f(x  +  t)—f(x  —  t)}/teC^P  in  (J7r,7r),  by  lemma  III,  we  may  alter 
the  values  of  /(£)  outside  the  interval  (  —  JTT,  fyr),  so  that  it  has  period  27T. 

Since  F^(x)  =  CxP\f(t)dt,  f(x)   is  the  OA  derivative  of  F,(x)  almost 
Jo 

everywhere,*  and  hence 

lim  /(*  +  *)=/(*)     (C,A+1)  (5-1) 

t->+o 

for  almost  all  values  of  x.  If  we  define  <f>(t)  and  i/r(t)  as  before,  it  follows  that, 
for  almost  all  values  of  x, 


lim  <f>(t)  =  f(x)     (^A+l),  (5-2) 

t->+Q 

and  lim  jfr(t)  =  0     ((7,A+1).  (5-3) 


It  follows  from  (5-2)  and  theoremf  IV  that,  for  almost  all  values  of  #,  the 
Fourier  series  off(t)  is  summable  (C)  to  f(x)  for  t  —  x.  By  a  result  due  to 
Marcinkiewicz  and  Zygmund,}  the  allied  series  of  f(t)  is  summable  (C) 
almost  everywhere.  Hence,  by  theorem  II,  (5-3)  and  lemma  XVII, 


in  (0,  TT)  for  almost  all  values  of  #.  The  result  stated  follows  from  lemma  III. 

Finally,  I  should  like  to  thank  Dr  Bosanquet  for  many  valuable  criticisms 
and  suggestions. 

*  I.e.    f(x)  =  lim  {{.C^F^x,  x  +  h}-  F^(x)]l[hl(\  +1)]}.     This    is     equivalent     to 
fx+h  ^° 

/(*)  dt  ~  hf(x)  (  C,  A)  aa  h  ->  0,  and  hence,  by  lemma  VII,  it  is  equivalent  to  (5-1). 
J  x 

•\  Or  from  theorem  Dj. 

|  J.  Marcinkiewicz  and  A.  Zygmund  (11,  6  and  26-30). 
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1.    Introduction 

The  bulk  of  the  traditional  work  on  invariant  theory  is  concerned  with 
the  full  linear  group.  Concomitants  under  the  orthogonal  groups  have  been 
studied  f  by  adjoining  a  fundamental  quadric,  and  seeking  the  simultaneous 
concomitants  of  the  ground  forms  and  the  quadric.  This  procedure  adds 
considerably  to  the  complexity  of  the  problems.  An  alternative  method  of 
considering  the  invariant  theory  of  orthogonal  groups,  among  other 
restricted  groups,  was  considered  hi  a  previous  paper,  J  to  which  frequent 
reference  will  be  made.  It  will  be  referred  to  by  the  shortened  title  of 
"Restricted  Groups".  The  method  of  characteristic  analysis  employed 
involved  the  expression  of  the  characters  of  the  orthogonal  group  in  terms 
of  /S-functions,  and  the  calculation  was  in  terms  of  ^-functions.  This  con- 
version to  ^-functions  involved  considerable  complexity  as  compared  with 
the  case  of  the  full  linear  group. 

The  invariant  theory  of  spinors§  was  discussed  in  another  paper  and  the 
concomitants  found  of  a  basic  spinor  in  3-8  variables. 

All  methods  seemed  over-elaborate  in  comparison  with  the  simplicity  of 
the  results  obtained,  and  search  was  made  for  a  simpler  procedure.  This 
search  has  brought  some  success  in  the  case  of  orthogonal  groups  in  a  small 
number  of  variables,  more  especially  in  3  and  4  variables. 

The  fundamental  principle  employed  in  this  paper  is  the  comparison  of 
an  orthogonal  space  with  an  auxiliary  space  whose  coordinates  are  the 
components  of  a  basic  spinor,  or,  in  the  case  of  the  4-dimensional  rotation 
group,  of  two  conjugate  basic  spinors.  There  is  a  complete  correspondence, 
usually  two-one,  between  these  spaces,  and  to  each  algebraic  form,  tensor 

f  Turnbull(14). 
j  Littlewood  (8,  9). 
§   Littlewood(lO). 
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or  spinor,  in  the  orthogonal  space,  there  corresponds  a  form  or  tensor  in 
the  auxiliary  space.  The  invariant  theories  of  the  two  spaces  are  in  complete 
correspondence,  and  a  problem  in  the  orthogonal  space  may  be  translated 
into  a  problem  in  the  auxiliary  space. 

Hence  the  invariant  theory  of  ternary  orthogonal  forms  is  identical  with 
the  invariant  theory  of  binary  forms  under  the  full  linear  group.  The  group 
of  the  spin  transformations  is,  strictly,  the  unitary  group,  but  invariant 
theory  is  identical  with  that  under  the  full  linear  group  save  that  the 
determinant  of  the  transformation  is  unity. 

The  rotation  group  in  4  variables  is  similarly  found  to  correspond  to  the 
double  binary  group.  Concomitants  under  the  full  orthogonal  group,  that 
is,  the  group  including  transformations  of  negative  determinant,  are  easily 
deduced  from  a  knowledge  of  concomitants  under  the  rotation  group. 

In  a  like  manner  the  orthogonal  group  in  5  variables  corresponds  to  the 
symplectic  group  in  4  variables;  the  rotation  group  in  6  variables  to  the  full 
linear  group  (unitary  group)  in  4  variables.  The  last  correspondence  has 
been  noticed  by  Turnbull  and  employed  to  find  the  concomitants  of  a 
quadratic  complex  in  4  variables,  though  all  the  possibilities  of  the  corre- 
spondence have  not  yet  been  exploited. 

In  8  variables  the  rotation  group  possesses  an  automorphism  in  which 
a  basic  spinor,  or  rather  one  of  the  conjugate  components  of  a  basic  spinor, 
corresponds  to  a  simple  vector.  In  this  correspondence  the  orthogonal 
group  hi  7  variables  corresponds  to  a  sub-group  of  the  8 -variable  rotation 
group  in  which  this  component  of  the  basic  spinor  is  kept  invariant. 

2.   Note  on  two  results  which  led  to  the  central  theory 

A  very  short  account  is  given  first  of  two  significant  results  which  led  to 
the  more  general  theory  on  which  the  paper  is  based.  Since  the  two  results 
are  consequences  of  the  more  general  theory  discussed  later,  it  is  not  thought 
worth  while  to  give  more  than  the  most  brief  description  here. 

A  method  of  studying  the  invariant-theory  of  a  group  which  is  a  direct 
product  was  mentioned  hi  "Restricted  Groups".  It  is  known  that  every 
4-dimensional  rotation  is  expressible  as  the  resultant  of  a  right-handed  and 
a  left-handed  rotation  which  mutually  commute,  f  The  rotation  group  is 
not  quite  the  direct  product  of  the  two  groups  of  right-handed  and 
left-handed  rotations  respectively,  since  the  rotation  J  with  matrix 
diag  (  —  1  — 1  — 1  — 1)  belongs  to  both  groups.  But  the  rotation  J  commutes 
with  every  rotation,  and  generates  a  self -con  jugate  sub-group  of  order  2, 
the  quotient  group  corresponding  to  which  is  a  direct  product.  In  fact,  it 

t   L-ittlewood  (3L 
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is  the  direct  product  of  two  groups,  each  simply  isomorphic  with  the 
ternary  rotation  group.  f 

Hence  the  group  of  rotations  in  4  variables  is  in  2  :  1  correspondence 
with  the  double  ternary  rotation  group.  The  two  -valued  nature  of  this 
correspondence  is  in  no  way  embarrassing,  and  fits  in  perfectly  with  the 
two-valued  nature  of  the  spin  representations.  Invariant  theory  under  the 
quaternary  rotation  group  proves  to  be  identical  with  the  invariant  theory 
of  the  double  ternary  rotation  group.  Results  for  this  group  are  readily 
deducible  by  the  methods  given  in  "Restricted  Groups  "  from  the  invariant- 
theory  of  the  simple  ternary  rotation  group. 

To  make  the  method  really  effective,  a  simple  technique  for  the  ternary 
rotation  group  was  thought  to  be  necessary. 

It  was  remarked  that  the  latent  roots  of  a  ternary  orthogonal  matrix 

were  of  the  form  ,        id       _^& 

i,     6   ,     e     , 

and  could  thus  be  arranged  to  be  in  geometric  progression.  Further,  the 
character  of  any  irreducible  representation  was  expressible  as 

\n\  =  l  +  2cos#  +  2cos20-f  ...  +  2costt# 

l-n)id  4.  ...  4-  e~iO  4.  1  _f  ei&  +  .  .  .  +  eni°, 


and  the  latent  roots  again  are  in  geometric  progression. 

In  a  paper  by  Little  wood  and  Richardson  J  a  formula  was  given  for  the 
8-f  unctions  of  a  set  of  quantities  in  geometric  progression.  From  this  may 
be  obtained  very  easily  the  following  generating  function  for  concomitants 
under  the  ternary  orthogonal  group: 

V  [_n]®{p}  =  XKr[r], 

then  Kp  is  the  coefficient  of  p~r  in  the  expansion  of 

-*»+l-2n+*-tn+J> 

^         ' 


This  generating  function,  which  holds  also  for  spinors  if  n  is  half  an  odd 
integer,  is  very  similar  to  a  generating  function  obtained  for  binary  con- 
comitants^ Closer  examination  showed  that  the  differences  were  only 
differences  of  convention,  and  that  the  generating  functions  were  essentially 
identical.  Thus  the  following  theorem  came  to  light: 

The  concomitants  of  a  form  of  type  [ri]  under  the  ternary  orthogonal  group 
are  in  complete  correspondence  with  the  concomitants  of  a  form  of  type  {2n} 
under  the  binary  full  linear  group. 

f  Littlewood  (3). 

i  Littlewood  and  Richardson  (11). 
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In  the  correspondence  a  spinor  of  type  [n  +  J]  corresponds  to  a  form  of 
type  {2n  +  1}.  The  correspondence  also  extends  to  the  simultaneous  con- 
comitants of  several  ground-forms. 

In  particular,  a  basic  spinor  of  type  [J],  which  has  two  components, 
corresponds  to  the  binary  vector  of  type  {!}.  The  identification  of  the  binary 
variables  with  the  two  components  of  a  basic  spinor  led  immediately  to 
a  direct  proof  of  the  correspondence. 

The  case  of  the  4-variable  rotation  group  then  appeared  as  an  obvious 
extension  of  the  same  principle,  and  further  extensions  to  5,  6  and  8  variables 
followed. 

The  principal  theory  will  now  be  developed  with  more  careful  detail. 

3.    The  ternary  orthogonal  group 

There  is  a  basic  spin  representation  of  degree  2  of  the  ternary  orthogonal 
group.  The  second  induced  matrix  of  this  spin  transformation  matrix  is 
equivalent  to*  the  original  orthogonal  matrix.  Denote  a  character  of  the 
orthogonal  group  by  [n],  and  a  binary  /S-function  by  {m}.  Then  the  corre- 
sponding equation  is 


Hence  if  (£,  tf)  are  the  components  of  a  basic  spinor,  the  coefficients  of 
the  quadratic  form  «#  +  &,•  +  «,  (3-1) 


transform  hi  an  equivalent  manner  to  the  coefficients  in  the  ternary  ortho- 
gonal form.  The  variables  in  the  ternary  form  may  be  taken  so  that  the 
transforming  matrix  is  the  same  as  for  the  quadratic  form  (3-1).  Then  if 
the  ternary  form  is  ax  +  by  +  hz,  (3-2) 


the  coefficients  in  the  two  forms  will  remain  the  same  under  all  transforma- 
tions. 

The  invariant  quadratic,  or  metric,  associated  with  the  orthogonal  form 
must  then  clearly  be  _  o  /Q  o\ 

xy      z  ,  \o  o) 

since  £V*  —  (f??)2 

is  identically  zero  and  thus  invariant. 

A  complex  transformation  would,  of  course,  transform  (3-3)  into  the 
more  familiar  metric  x*  +  yz  +  z2  (3-4) 

if  this  special  form  were  required. 

An  algebraic  form  of  degree  2n  in  £,  tj  may  thus  be  associated  with  an 
algebraic  form  of  degree  n  in  x,  y,  z,  the  coefficients  being  the  same  in  the 
two  forms.  Further,  since  xy  may  be  replaced  by  z2,  the  linear  invariants 
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of  the  n-ic  in  x,  y,  2,  i.e.  those  terms  which  involve  the  factor  (xy—z2),  do 
not  appear.  Thus  the  principal  part  of  the  n-ic  only  is  involved,  and  it  is 
an  algebraic  form  of  type  [ri\.  To  every  form  of  type  {2n}  in  £,  TJ  there  is 
a  form  of  type  [»]  in  x,  y,  z  which  has  the  same  coefficients. 

The  invariant  theory  of  the  coefficients  is  thus  identical  for  the  binary 
2n-ic  and  for  the  ternary  orthogonal  n-ic. 

The  spinors  fit  completely  into  the  theory,  for  a  spinor  of  type  [n  +  J] 
could  be  regarded  as  the  coefficients  of  a  polynomial  of  degree  n  in  the 
variables,  and  of  degree  1  in  £,  77.  Such  a  form  is  in  obvious  correspondence 
with  a  form  of  degree  (2n  4-  1)  in  g,  17. 

It  may  be  remarked  here  that  the  method  is  already  well  known  of  using 
the  various  terms  in  an  algebraic  form  as  new  variables,  so  as  to  associate 
the  form  with  another  algebraic  form  of  a  different  degree  in  a  different 
number  of  variables.  Thus  with  two  variables  #,  y,  if  the  powers  and  pro- 
ducts of  degree  2,  namely,  x2,  xy,  yz,  are  treated  as  new  variables,  the  binary 
quartic  can  be  associated  with  a  ternary  quadratic.  Such  a  treatment  is 
the  basis  of  Sylvester's  "unravelment",t  and  recently  EdgeJ  has  further 
discussed  the  procedure. 

Sylvester  points  out  that  a  concomitant  of  the  form  in  the  new  variables 
necessarily  gives  a  concomitant  in  the  original  variables.  The  identity  of 
the  systems  of  irreducible  concomitants  in  the  two  sets  of  variables  does  not, 
however,  follow  from  this.  In  general,  the  group  of  transformations  in- 
duced in  the  new  variables  is  some  sub-group  of  the  full  linear  group,  and 
Sylvester's  result  concerning  the  concomitants  depends  on  the  fact  that 
the  concomitants  under  one  group  are  necessarily  concomitants  under  a 
sub-group.  The  identity  of  the  systems  of  irreducible  concomitants,  however, 
depends  on  the  complete  isomorphism  between  the  groups.  It  is  such 
isomorphisms  which  in  the  various  instances  are  established  here.  The 
isomorphism  is  not,  in  any  of  the  instances,  complete,  without  the  intro- 
duction of  spinors.  These  were,  of  course,  unknown  to  Sylvester. 

Examples.  The  concomitants  up  to  degree  5  in  the  coefficients  were  given 
in  "Restricted  Groups"  for  a  ternary  orthogonal  cubic,  the  types  being 
as  follows:  Degree  1;  [3]. 

Degree  2;     [4],  [2],  [0]. 
Degree3;     [6],  [4],  [3],  [1]. 
Degree  4;     [5],  [3],  [2],  [0]. 
Degree  5;     [4],  [2],  [1]. 


t  Sylvester  (12),  vol.  1,  284. 
J  Edge(l). 
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The  generating  function  (2-1)  for  concomitants  of  degree  6  gives 

,_18  (1  ~/>7)  (1  -P8)  (1  -P9)  (1  ~P10)  (1  -P11)  (1  -P12) 
P  (I  -  p2)  (1  -p3)  (1  -p4)  (1  -p5)  (I  -ft) 

=  p~18  +p~16  +p~15  +  2p~14  +  2p~18  +  4p~12  +  2p~n  -f  5p-10  +  4p~ 
+  6p~8  +  4p~7  +  7p~6  +  3p~5  +  6p~4  +  3p~3  -F  4p~2  +  3-3/9 

-  4p3-  3/94-  6p5-  3/)6-  7p7-  4/08-  6/)9~  4p10-  6p"-  2p12~ 

-  2pu  -  2p15  -  p16  -  p17  -  p19. 

Thus        [3]  ®  {6}  »  [18]  +  [16]  -f  [15]  +  2[14]  +  2[13]  +  4[12]  +  2[11] 
+  5[10]  +  4[9]  +  6[8]  +  4[7]  +  7[6]  +  3[5]  +  6[4] 


Deleting  the  reducible  concomitants,  we  obtain  for  the  types  of  irre- 
dudbles:  Degree  6;     [3],  [3],  [0]. 


Order  for 

sextic 

0 

2 

4 

6 

8 

10 

12 

Order  for 

cubic 

0 

1 

2 

3 

4 

5 

6 

Degree 

1 

— 

— 

— 

f 

— 

— 

— 

2 

(//)e 

— 

(//)*  =  i 

— 

(//)*  =  # 

— 

— 

3 

— 

(/  i)*  =  I 

— 

(/  *")* 

(fi) 

— 

(fH)  =  t 

4 

(^)4 

— 

(/Z)2 

(fl) 

— 

(Hi) 

— 

5 

— 

(»*)• 

(*  0 

— 

(HI) 

— 

— 

6        | 

(H? 

— 

— 

((fi}*l) 

— 

— 

— 

7 



(/P)4 

(/Z*)3 

(    „  )2 







8 

— 

(i  £2)3 



— 

— 

— 

— 

9 

— 

— 

((fi)e^ 

— 

— 

— 

— 

10 

(fl3)* 

(/  Z3)* 

— 

— 

— 

— 

— 

12 

— 

((fi)  J3)6 

— 

— 

— 

— 

— 

15 

((fi)l*)* 

— 

__ 

— 

— 

— 

— 

There  is  no  need,  however,  to  proceed  with  this  method.  Grace  and 
Youngf  give  the  complete  system  of  26  irreducible  concomitants  for  a 
binary  sextic.  The  table  is  reproduced  above.  By  reason  of  the  correspond- 
ence between  the  ternary  orthogonal  group  and  the  binary  full  linear  group, 
the  same  table  gives  the  complete  set  of  26  concomitants  of  a  ternary 
orthogonal  cubic.  The  orders  of  the  forms,  instead  of  reading  respectively 
0, 2, 4, 6, 8, 10, 12,  should  then  read  0, 1 , 2, 3, 4, 5, 6.  The  symbol  (/,  h)*,  instead 


t  Grace  and  Young  (2). 
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of  denoting  the  ith  transvectant  of  the  product  fh,  would  then  denote  the 
tensor  obtained  by  contracting  the  tensor  product  fh  with  the  metric  tensor 
t  times.  Thus,  if  the  coefficients  of  the  ternary  cubic  are/^fc,  and  g^  denotes 
the  metric  tensor,  the  (//)6  denotes  the  contraction 


In  a  similar  manner  it  follows  that,  since  a  binary  cubic  has  4  irreducible 
concomitants  of  the  following  types: 

Degree  1;  {3}, 

Degree  2;  {2}, 

Degree  3;  {3}, 

Degree  4;  {0}, 

then  a  ternary  spinor  of  type  [f]  has  4  irreducibles  of  types: 

Degree  1;  [f], 

Degree  2;  [1], 

Degree  3;  [f], 

Degree  4;  [0]. 

The  irreducible  systems  for  the  ternary  orthogonal  group  may  be  taken 
as  known  from  the  known  results  for  binary  forms,  for  the  following  types  :f 

[«,  [1],  [f],  [2],  [f],  [3],  ffl,  [4]. 
Simultaneous  systems  are  known  for  [f  ]  and  [f  ],  [1]  and  [f  ].  [1]  and  [2]. 

4.    The  quaternary  orthogonal  group 

Two  groups  are  called  orthogonal  in  4  variables,  that  which  comprises 
the  transformations  of  positive  determinant  only,  and  that  which  includes 
also  transformations  of  negative  determinant.  The  former  will  be  called 
the  rotation  group  irrespective  of  whether  the  metric  is  the  sum  of  the 
squares,  as  is  usually  understood  in  a  rotation,  or  any  other  non-singular 
quadratic  form;  the  latter  the  full  orthogonal  group. 

The  basic  spinor  for  the  full  orthogonal  group  is  of  degree  4;  for  the 
rotation  group  the  spinor  separates  into  two  conjugate  parts  each  of  degree  2. 
This  makes  possible  a  considerable  simplification  for  the  rotation  group, 
and  this  group  will  be  mainly  considered  here.  Invariants  under  the  full 
orthogonal  group  can  be  readily  deduced  from  a  knowledge  of  the  invariants 
under  the  rotation  group. 

t  Turnbull(14),  246. 
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The  character  [A19  A2]  of  the  full  orthogonal  group  separates  into  two 
conjugate  characters  for  the  rotation  group  if  A24=0.  It  is  convenient  to 
denote  these  by  [A1}A2]  and  [Al7  —  A2],  The  distinction  between  the  two  is 
conventional.  Using  a  dash  to  denote  a  character  of  the  full  orthogonal 
group,  this  separation  can  be  expressed  by  the  equation 


i,  -AJ. 
In  particular  [1  1]'  =  [1  1]  +  [1  -  1]. 

Thus  a  6  -vector  under  the  full  orthogonal  group  separates  into  two  parts 
with  3  components  each,  under  the  rotation  group.  The  6  independent 
rotations  in  4  dimensions  thus  separate  into  two  sets  of  3  which  are  called 
respectively  right-handed  and  left-handed  rotations.  A  right-handed  rotation 
consists  of  a  rotation  in  one  plane  combined  with  an  equal  rotation  about 
this  plane  as  axis;  for  a  left-handed  rotation  the  direction  of  the  rotation 
about  the  plane  as  axis  is  reversed. 

If  the  latent  roots  of  the  orthogonal  matrix  are 


then  the  basic  spin  character  is  given  by 
tl  iJ'  —  ^  cos  ia  cos  \P 


For  the  rotation  group  this  separates  as  follows, 


The  signs  attached  to  a  and  /?  are  conventional,  as  is  the  distinction 
between  [J  J]  and  [J  —  |].  A  convention  is  adopted  concerning  the  relative 
signs  of  a  and  ft  which  ensures  that  a  =  ft  for  a  right-handed  rotation  and 
a  =  —  /?  for  a  left-handed  rotation. 

It  follows  that  for  a  right-handed  rotation  [J  —  £]  =  2,  and  the  corre- 
sponding spinor  is  transformed  by  the  identical  matrix.  Similarly  for  a 
left-handed  rotation  [£  £]  =  2. 

Thus,  if  any  rotation  is  expressed  as  the  resultant  of  a  right-handed  and 
a  left-handed  rotation,  the  spin  matrix  of  character  [J  £]  depends  only  on 
the  right-handed  part,  and  the  spin  matrix  of  character  [  £  —  J]  on  the 
left-handed  part. 

The  two  conjugate  parts  of  the  basic  spinor  are  thus  algebraically  in- 
dependent. The  simultaneous  concomitants  of  two  spinors  of  types  [J  J] 
and  [^  —  J]  respectively  are  next  considered. 
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Since 
it  follows  that 


[|  £]  ®  {2}  =  1  +  e**+ 

=  1-f  2cos(a4-)tf). 


Thus     [ 

=  2  +  4  cos  a  cos/? 
-  [1  1]' 

=  [!!]  +  [1-1]. 
The  identification  [-J  J]  ®  {2}  =  [1  1], 


is  inevitable. 
Similarly, 


=  4  cos  J^>a  cos  \y>p  4-  4  cos  J(^>  —  2)  a  cos  J(^p  —  2)  /?+... 


Hence 

Lastly 

»p,  to]  Ete,  - 


,  \p\, 


This  is  easily  shown  to  be  expressible  as 


cos  a, 
cos/?, 


which  is  a  recognized  form  for 


Thence  it  may  be  inferred  that 

[  J 
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Thus  to  each  quaternary  orthogonal  type  there  exists  exactly  one  con- 
comitant of  the  two  algebraically  independent  spinors.  The  coefficients  of 
a  quaternary  orthogonal  form  of  type  [r,s]  could  be  expressed  as  the 
coefficients  of  a  polynomial,  of  degree  (r  +  s)  in  the  two  components  of  the 
spinor  of  type  [£  £],  and  of  degree  (r  —  s)  in  the  two  components  of  the 
spinor  of  type  [£  —  J]. 

The  correspondence  is  thus  exhibited  between  the  quaternary  orthogonal 
form  of  type  [rts]  and  the  double  binary  form  of  type  {r  +  s}  {r  —  s}'  . 
Concomitants  for  the  quaternary  rotation  group  are  thus  in  complete  corre- 
spondence with  concomitants  for  the  double  binary  group,  and  are  easily 
deducible  from  the  theory  of  concomitants  for  binary  forms. 

The  extension  to  forms  under  the  full  orthogonal  group  presents  no  great 
labour.  For  a  simple  quantic  of  type  (XT,  the  character  is  the  same  as  for 

the  rotation  group,  i.e.  r  -,,      r  -, 

\n\   —  \n\. 

Hence  [n]'  ®  {p}  =  [n]  ®  {#}. 

The  concomitants  are  the  same  a&  for  the  rotation  group  save  that  a  pair 
of  conjugate  concomitants,  of  types  [r,  s]  and  [r,  —  s]  respectively,  combine 
to  form  a  single  concomitant  of  type  [r,  «]',  if  s  4=  0. 

For  s  —  0  the  theory  does  not  determine  whether  a  concomitant  which 
under  the  rotation  group  is  of  type  [r]  will  be,  under  the  full  orthogonal 
group,  of  type  [r]'  or  of  the  associated  type  [r]'*.  A  method  of  distinguishing 
the  two  cases,  however,  was  described  in  "Restricted  Groups"  by  the 
evaluation  of  [A]  ®  {p}  for  the  special  transformation  of  negative  determinant 
for  which  the  matrix  is  diag  (1  1  1  —  1). 

For  an  orthogonal  form  of  type  [r,  *]',  since 


the  concomitants  are  deduced  from  the  simultaneous  concomitants  of  a 
pair  of  conjugate  forms  of  types  [r,«]  and  [r,  —  s]  respectively  under  the 
rotation  group. 

Reducibility.  To  obtain  a  finite  basis  for  the  concomitants  of  a  system  of 
ground  forms  the  concept  of  reducibility  is  necessary.  A  concomitant  is 
regarded  as  reducible  if  it  is  the  principal  part  of  the  product  of  two  con- 
comitants, or  if  it  is  a  linear  combination  of  such  principal  parts.  The 
definition  of  principal  part  is  therefore  fundamental.  For  consistency  this 
definition  for  the  rotation  group  must  be  different  from  that  for  the  full 
orthogonal  group. 

The  principal  part  of  the  product  of  forms  of  types  [p,  q]'  and  [r,  s]'  for 
the  full  orthogonal  group  is  a  form  of  type  [p  +  rtq  +  8]'.  For  the  rotation 
group,  if  q  and  s  are  positive,  consistently  with  this,  the  principal  part  of 
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the  product  of  forms  of  types  [p,q]  and  [r,s]  is  of  type  [p  +  r,  q  +  s].  By 
proceeding  to  concomitants  of  the  conjugate  types  it  is  clear  that  the 
principal  part  of  the  product  of  forms  of  types  [p,  —  q]  and  [r,  —  s]  must  be 
of  type  [p  +  r,  —  q  —  s\. 

The  product  of  forms  of  types  [p,  q]  and  [r,  —  $],  however,  has  no  analogue 
for  the  full  orthogonal  group.  The  obvious  definition  of  the  principal  part, 
however,  is  that  form  in  the  product  of  type  [p  +  r,  q  —  $],  and  this  definition 
seems  quite  satisfactory.  This  form  always  differs  from  zero,  and  in  some 
cases  it  is  the  only  non-zero  form  in  the  product.  Thus 

[11]  [1-1]  -[2]. 

An  example  will  illustrate  the  significance.  Suppose  that  in  4  variables 
the  Riemann-Chrystoffel  tensor  R^p  is  expressible  as  a  direct  product  of 
two  6  -vectors,  P  _  j  p 

^ijkp  ~  ^ij-Bkp' 

or,  to  ensure  the  correct  symmetry,  say 


Then,  if  A^  corresponds  to  a  right-handed  rotation  and  B^  to  a  left- 
handed  rotation,  both  the  conformal  curvature  tensor  and  the  total 
curvature  R  will  be  identically  zero.  The  Riemann-Chrystoffel  tensor  is 
completely  defined  by  its  Ricci  tensor  Rik  which  is  of  type  [2]. 

This  definition  of  reducibility  is  particularly  useful  as  it  corresponds 
completely  with  the  definition  of  reducibility  for  the  corresponding  double 
binary  forms. 

Examples.  The  concomitants  of  a  quaternary  orthogonal  quadratic  are 
first  considered. 

Since  [2]  =  [1  1][1  -  1]  =  [HI®  {2}[l-i]®{2}, 

the  quadratic  corresponds  to  the  double  binary  form  of  type  {2}  {2}'. 
Then 


=  ({4}  +  {2*})  ({4}'  +  {*«}')  +  {3  i}{3  1}' 

=  {*}  {4}'  +  {4}  {0}'  +  {0}  {4}'  +  {0}  {0}'  +  {2}  {2}', 

since,  the  determinant  being  unity,  the  ^-functions  {1  1}  and  {2  2}  are  taken 

to  be  unity  or  {0}. 

({2}  {2}')®  {3}  =  {2}  ©{3}  {2}'®  {3}  +  {2}®  {2  1}{2}'®{2  1} 

+  {2}®  {I3}  {2}'®  {I3} 
=  {6}  {6}'  +  {6}  {2}'  +  {2}  {6}'  +  {2}  {2}'  +  {4}  {4}'  +  {4}  {2}' 
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Thus  the  irreducible  concomitants  are  of  types : 

Degree  1;     {2}  {2}'. 

Degree  2;     {4}{0}',  {0}{4}',  {0}{0}',  {2}{2}', 

Degree  3;     {4}{2}',  {2}{4}',  {2}{2}',  {0}{0}'. 
Similarly  we  obtain: 

Degree  4;     {4}  {0}',  {0}  {4}',  {4} {2}',  {2} {4}',  {0}  {0}'. 

Hence  the  types  of  irreducible  concomitants  up  to  degree  4  for  a  quadratic 
under  the  quaternary  rotation  group  are: 

Degree  1;     [2]. 

Degree  2;  [2  2],  [2 -2],  [0],  [2]. 
Degree  3;  [3  1],  [3  - 1],  [2],  [0]. 
Degree  4;  [2  2],  [2  -2],  [3  1],  [3  -  1],  [0]. 

For  the  full  orthogonal  group  conjugate  characters  are  coupled,  e.g. 
[2  2] +  [2 -2]  =  [22]'. 

To  the  list  must  then  be  added  those  concomitants  which,  though  reducible 
under  the  rotation  group,  become  irreducible  under  the  full  orthogonal 
group.  The  types  are: 

Degree  1;  [2]'. 

Degree  2;  [2  2]',  [2]',  [0]'. 

Degree  3;  [3  1]',  [2]',  [0]'. 

Degree  4;  [2  2]',  [3  1]',  [0]',  [4]'. 

The  irreducible  concomitants  go  up  to  degree  6,  but  these  have  been 
obtained  by  other  methodsf  and  will  not  be  examined  further  here.  The 
results  are  in  agreement. 

The  quadratic  complex  is  next  considered.  Under  the  rotation  group  this 
separates  into  two  conjugate  parts  according  to  the  equation 

[22]'  =  [2  2]  +  [2 -2]. 

Consider  first  the  right-handed  form  of  type  [2  2}. 
Since  [2  2]  =  ft  $]  <g>  {4}, 


t  Littlewood(9). 
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the  concomitants  correspond  to  those  of  a  single  binary  quartic.  The 
irreducible  concomitants  are  known  to  be: 

Degree  1;     {4}. 
Degree  2;     {4},  {0}. 
Degree  3;     {6},  {0}. 

Thus  the  irreducible  concomitants  of  a  quaternary  orthogonal  form  of 
type  [2  2]  are  of  types: 

Degree  1;     [22]. 

Degree  2;     [2  2],  [0]. 
Degree3;     [3  3],  [0]. 

Corresponding  results  hold  for  the  left-handed  quadratic  complex  of 
type  [2 -2]. 

The  concomitants  of  the  complete  quadratic  complex  are  the  simulta- 
neous concomitants  of  these  two  forms.  Clearly,  since 

[pp][q-q]  =  [p+q,p-q], 

all  concomitants  which  contain  both  are  reducible.  Hence  the  complete 
system  consists  of  10  forms. 

Under  the  full  orthogonal  group  these  combine  to  form  7  irreducible 
forms,  but  to  these  must  be  added  10  other  forms  which,  reducible  under  the 
rotation  group,  are  yet  irreducible  under  the  full  linear  group.  These  are: 

Degree  2;     [4]'  from  [2  2]  [2 -2]. 

Degree3;     [4]'  from  [2  2]  [2 -2], 

[4]'  from  [2  2]  [2 -2]. 

Degree  4;     [4]'  from  [2  2]  [2  -  2], 

[51]'  from  [3  3]  [2 -2]. 

Degree  5;     [5  1]'  from  [3  3]  [2  -  2], 

[71]'  from  [3 -3]  [22]  [22]. 

Degree  6;     [7  1]'  from  [3  -  3]  [2  2]  [2  2], 

[6]'  from  [3  3]  [3 -3]. 

Degree  7;     [7  1]'  from  [3  -  3]  [2  2]  [2  2]. 

For  partitions  in  two  parts,  one  only  of  the  conjugate  products  is  given. 
The  products  corresponding  to  [3  3]  [3  3]  [2  -  2]  [2  -  2]  [2  -  2]  would  have 
been  eligible  for  this  list  except  for  the  syzygy  which  connects  the  square 
of  the  form  of  type  [3  3],  or  correspondingly  the  square  of  the  sextic 
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concomitant  of  the  binary  quartic,  with  other  reducible  forms.  This  syzygy 
establishes  the  reducibility  of  the  corresponding  invariants.  Thus  the 
complete  list  of  irreducible  concomitants  of  a  quadratic  complex  under  the 
full  quaternary  orthogonal  group  consists  of  the  1 7  forms : 

Degree  1;  [22]. 

Degree  2;  [2  2],  [0],  [0]*,  [4]. 

Degree  3;  [3  3],  [0],  [0]*,  [4],  [4]*. 

Degree  4;  [5  1],  [4]. 

Degree  5;  [7  1],  [5  1]. 

Degree  6;  [7  1],  [6]. 

Degree  7;  [71]. 

Since  the  binary  sextic  has  26  irreducible  concomitants,  the  right-handed 
and  left-handed  cubic  complexes  under  the  quaternary  rotation  group  have 
each  26  irreducible  concomitants.  The  complete  cubic  complex  has  thus  52. 
Under  the  full  orthogonal  group,  because  of  the  difference  of  reducibility 
there  are  several  hundreds  more. 

The  determination  and  enumeration  of  these  is  quite  straightforward 
and  only  laborious  because  they  are  so  numerous.  It  is,  however,  necessary 
to  know  the  syzygies  of  the  binary  sextic  in  order  to  determine  which  are 
reducible. 

The  spinors  are  next  considered.  The  case  of  the  basic  spinor  is  knownf 
for  the  full  orthogonal  group.  Under  the  rotation  group,  since  the  basic 
spinor  corresponds  to  the  double  binary  form  {1} +  {!}',  it  is  obvious  that 
all  concomitants  except  the  spinor  itself  are  reducible.  The  only  case  of 
a  concomitant  reducible  over  the  rotation  group  but  irreducible  over  the 
full  orthogonal  group  is  clearly  that  corresponding  to 

[*«[*-«-[!]. 

so  that  the  4- vector  of  degree  2  in  the  spinor  is  the  one  irreducible  concomi- 
tant over  the  full  orthogonal  group. 

The  spinor  of  type  [f  f  ]  corresponds  to  the  binary  cubic.  The  irreducible 
concomitants  of  a  binary  cubic  are  as  follows: 

Degree  1;  {3}. 

Degree  2;  {2}. 

Degree  3;  {3}. 

Degree  4;  {0}. 

t  Littlewood(lO). 
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There  is  a  sextic  syzygy  of  degree  6.  The  irreducible  concomitants  of  a  right- 
handed  spinor,  and  those  of  a  left-handed  spinor,  are  therefore  as  follows : 
Degree  1;     [f  f].  Degree  1;     [|-f]. 

Degree  2;     [1  1].  Degree  2;     [1  - 1]. 

Degree3;     [f  f].  Degree  3;     [f  f]. 

Degree  4;     [0].  Degree  4;     [0]. 

These  are  the  8  irreducible  concomitants  of  the  complete  spinor  under 
the  rotation  group.  For  the  full  orthogonal  group  must  be  added: 
Degree  2;     [3]'        from     [f  f  ]  [f  -  f  ]. 
Degree  3;     [f  £]'     from     [f  f]  [1  —  1]. 
Degree  4;     [3]'        from     [ff][f-f], 
[3]'        from     [|-|][||], 
[2]'        from     [1  1]  [1  -  1]. 
Degree  5;     [f  J]      from     [f  f]  [1  —  1], 

[f  J]      from     [1  1]  [1  1]  [|  —  f ]. 
Degree  6;     [3]         from     [|f][|-|]. 

The  complete  list  of  irreducible  concomitants  of  a  spinor  of  type  [f  f]' 
under  the  full  orthogonal  group  thus  contains  14  forms  of  the  following 
tyP68:  Degree  1;  [f  f]'. 

Degree  2;     [1  1]',  [3]'. 

DegreeS;     [f  f]',  [f  J]'. 

Degree  4;     [3]',  [3]'*,  [2]',  [0]',  [0]'*. 

Degree5;     [f  £]',  [}  J]'. 

Degree  6;     [3]'. 

Degree?;     [fj]'. 

Lastly  the  spinor  of  type  [f  J]  is  considered,  for  which  the  calculation 
is  much  more  complex.  The  corresponding  double  binary  character  is 
{2}{1}'.  Hence 


({2}{1}')®{3}  =  {2}®  {3}  {3}'  +  {2}®  {2  1}{2  1}' 


({2}  {!}')  ®  {4}  -  {2}  ®  {4}  {4}'  +  {2}  ®  {3  1}  {3  1}'  +  {2}  ®  {2*}  {2*}' 
=  {«}  W  +  {*}  {*}'  +  {0}  {4}'  +  {6}  {2}'  +  {4}  {2}' 


({2}  {!}')  ®  {5}  =  {10}  {5}'  +  {6}  {5}'  +  {2}  {5}'  +  {8}  {3}'  +  {6}  {3}'  +  {4}  {3}' 


364  D.  E.  LITTLEWOOD  [13  Dec. 

Deleting  reducible  concomitants,  we  see  that  the  types  of  irreducibles  are: 
Degree  2;     {2}  {0}'  +  {0}  {2}'. 
Degree  3;     {2}  {I}'. 
Degree  4;     {0}  {0}'. 

The  irreducible  concomitants  of  the  right-hand  spinor  of  type  [f  J]  are 

therefore:  -^  ,       ra  n 

Degree  1;     [f  J]. 

Degree  2;     [1  1]  [1  -  1]. 
Degree  3;     [f  J]. 
Degree  4;     [0]. 

This  set  is  complete  for  ail  degrees. 

There  are  5  corresponding  concomitants  of  the  left-handed  spinor.  The 
simultaneous  concomitants  of  the  complete  spinor  however  are  not  all 
reducible,  even  for  the  rotation  group,  as  they  were  for  the  other  spinors 
considered.  The  following  analyses  give  the  concomitants  for  the  respective 
degrees  in  the  two  parts  of  the  spinor;  1-f-l,  2-fl,  3+1,  2  -f  2  : 

{2}  {!}'  {1}  {2}'  =  {3}  {3}'  +  {3}  {!}'  +  {1}  {3}'  +  {1}  {!}', 

({2}  {1}')®  {2}  {1}  {2}'  =  ({4}  {2}'  +  {0}  {2}'  +  {2}  {0}')  {1}  {2}' 
=  {5}  {4}'  +  {3}  {4}'  +  {5}  {2}'  +  {3}  {2}' 


({2}  {!}')  ®  {3}  {1}  {2}'  =  {7}  {5}'  +  {7}  {3}'  +  {7}  {!}'  +  {5}  {5}'  +  {5}  {3}' 

+  {5}{l}'  +  {3}{5}'  +  {3}{3}'  +  {3}{l}' 
+  {1}  {5}'  +  {1}  {3}'  +  {!}{!}'  +  {5}  {3}' 


+  {3}  {!}'  +  {!}  {3}'  +  {!}{!}', 

({2}  {1}')®  {2}  ({1}  {2}')®  {2}  =  {6}  {6}'  +  {6}  {4}'-f  {6}  {2}'  +  {4}  {6}'  +  {4}  {4}' 

+  {4}  {2}'  +  {2}  {6}'  +  {2}  {4}'  +  {2}  {2}' 


+  {4}  {2}'  +  {2}  {2}'  +  {2}  {6}'  +  {2}  {4}' 
+  {2}  {2}'  +  {4}  {4}'  +  {2}  {4}'  +  {0}  {4}' 
+  {0}  {4}'  +  {0}  {2}'  +  {0}  {0}'  +  {4}  {0}' 


1945]  INVARIANT  THEORY  UNDER  ORTHOGONAL  GROUPS  365 

Deleting  the  reducible  concomitants,  we  obtain  for  the  irreducibles  : 
Degree  1  +  1;     {3}  {!}',  {1}  {3}',  {1}  {!}'. 
Degree  2  +  1  ;     {5}  {0}',  {3}  {0}',  {1}  {0}',  {1}  {2}',  {1}  {2}'. 
Degree  3  +  1  ;     {3}  {!}',  {1}  {3}',  {1}  {!}',  {1}  {!}'. 
Degree  2  +  2;     {4}  {0}',  {0}  {4}',  {2}  {0}',  {0}  {2}',  {0}  {0}',  {0}  {0}'. 

The  irreducible  concomitants  of  degrees  4+1  and  3  +  2  are  obtained 
similarly.  The  list  of  irreducibles  up  to  degree  5  in  the  coefficients  for  the 
complete  spinor  of  type  [f  J]  +  [f  —  J]  over  the  rotation  group  are  as  follows  : 


Degree  1  +  0; 
0+1; 
2  +  0; 

H-H. 
[i  i],  [i 

-1]. 

0  +  2; 

[ii], 

[i 

-1]. 

i  +  i; 

[21], 

[2 

-1], 

[i] 

3  +  0; 

[f  |]. 

2  +  1; 

[ttl, 

ti 

|]>[* 

E 

.[!-«.[!-«• 

1  +  2; 

[f-i 

•],[f-fi,[ 

*-«. 

[f  «,  [f  «• 

0  +  3; 

»-4 

4  +  0; 

[0]. 

3+1; 

[21], 

[2 

-1], 

[!],[!]- 

2  +  2; 

[22], 

[2 

-2], 

[1 

!],[!- 

-1],[0],[0]. 

1  +  3; 

[21], 

[2 

-1], 

[1] 

,[!]• 

0  +  4; 

[0]. 

4  +  1; 

[if], 

[f 

-M, 

[* 

*]• 

3  +  2; 

[f-E[f-f 

[LI 

I-E 

»-«,»-«.»«• 

2  +  3; 

[if], 

[f 

SMI 

:f] 

,  ti  *]. 

[*«,»-«• 

1  +  4; 

ti-i 

f], 

H«. 

[* 

-«• 

There  are  further  irreducibles  of  degree  6  and  more. 

5.    Five  variables 

The  orthogonal  group  in  5  variables  has  a  basic  spin  representation  of 
degree  4.  The  invariant  matrices  of  this  spin  matrix  are  now  examined. 
The  latent  roots  of  the  5-dimensional  orthogonal  matrix  may  be  expressed 

1. 
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The  latent  roots  of  the  spin  representation  aref 

giiXa-f/?),       et(a-/?)j       eit(yff-<x)> 

Hence 


i  i]  ®  I2}  = 

=  [1  1]. 
il  ®  {I2}  =  e*a  + 


It  follows  that  the  spin  representation  has  an  invariant  linear  complex. 
Thus  the  group  of  the  spin  transformations  is  either  the  4-dimensional 
symplectic  group  or  else  some  sub-group  of  this.  Since  every  character  of 
a  group  is  also  a  character,  simple  or  compound,  of  its  sub-group,  it  follows 
that  the  operation  ®  may  be  followed  by  a  character  of  the  4-dimensional 
symplectic  group.  Such  a  character  is  denoted  by  (A)  =  (A^  A2>. 

Thus  [i«©<!  !>  =  [!]. 

By  considering  the  principal  parts  involved,  it  is  clear  that  [£  4]  <g)  <£>,  #> 
includes  the  character  [&(p  +  q),  $(p  —  g)]  of  the  orthogonal  group.  The 
number  of  terms  involved  may  be  determined^  by  the  evaluation  of  (p,  q) 
and  \$(p  +  q),  \(p  —  <?)]  f°r  the  identical  transformation.  Using  the  formula 

n&rii  (&?-&!) 


where 

it  follows  that 


Again,  for  the  orthogonal  group,  the  formula 

n[(aj>-ag)(ap4-gg  +  n~2 
L  1>*"'    *J 


where  as  =  Ag  —  54-  1,  is  used,  which  holds  also  for  spinors.  This  gives 


t  Littlewood(S),  Chap.  xi. 
t  Littlewood(6,  7). 
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Since  [J  £]  ®  <p,  q)  includes  [  %(p  +  q),  %(p  —  q)~\  and  has  the  same  number 
of  terms,  it  follows  that  the  two  representations  are  identical,  and 

[HI®  <P>4>  =  HKp  +  flf),  *(*>-«)]• 

Again,  since  [£  4]  (8)  <1  1>  =  [1], 

it  follows  that 


Thus  <^,<7>  =  < 

The  5  -dimensional  orthogonal  group  is  thus  isomorphic  with  the  4-dimen- 
sional  symplectic  group.  The  isomorphism  is  not  simple  but  2:1,  since  the 
two  -valued  spinors  correspond  to  simple  representations  of  the  symplectic 
group. 

This  isomorphism  does  not  give  a  markedly  easier  method  of  finding 
concomitants  under  the  5-dimensional  orthogonal  group  as  do  previous 
results  in  fewer  variables.  It  is  true  that  the  4-dimensional  symplectic 
group  is  easier  to  deal  with  than  the  5-dimensional  orthogonal,  but  to 
counterbalance  this  the  degrees  of  the  corresponding  forms  are  higher  for 
the  symplectic  group. 

The  advantages  are  about  equally  divided.  For  some  forms  the  symplectic 
representation  is  easier,  for  others  the  orthogonal.  But  the  labour  of  finding 
the  concomitants  is  at  least  halved.  If  the  concomitants  of  a  form  or  system 
of  ground  forms  is  known  for  the  one  group,  then  the  concomitants  of  the 
corresponding  form  or  set  of  forms  in  the  other  group  is  immediately 
deducible. 

Thus,  in  "Restricted  Groups"  the  complete  set  of  concomitants  of  a 
quaternary  symplectic  quadratic  was  obtained.  The  types  are  as  follows  : 

Degree  1;  <2>. 

Degree  2;  <2*>,  <!»>,  <0>. 

Degree  3;  <2). 

Degree  4;  <3  1),  <0>. 

From  these  may  be  deduced  the  complete  system  of  concomitants  of 
a  linear  complex  under  the  orthogonal  group  in  5  variables.  The  types  are 
as  follows:  Degree  1;  [I2]. 

Degree  2;     [2],  [1],  [0]. 

Degree  3;     [I2]. 
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On  the  other  hand,  in  the  same  paper  the  concomitants  of  a  5-dimensional 
orthogonal  quadratic  were  determined  up  to  degree  5  in  the  coefficients, 
the  types  being: 

Degree  1;     [2]. 

Degree  2;     [2*],  [2],  [0]. 

DegreeS;     [3  1],  [22],  [2],  [0]. 

Degree  4;     [3  2],  [3  1],  [2*],  [2»],  [2],  [0]. 

Degree  5;     [4  3],  [3  2],  [3  1],  [3  1],  [2«],  [0]. 

The  concomitants  of  the  quaternary  symplectic  quadratic  complex  were 
obtained  up  to  degree  3  only  in  the  coefficients: 

Degree  1;     <22>. 
Degree  2;     <4>,  <22>,  <0>. 
Degree  3;     <4  2),  <4>,  <22>,  <0>. 

The  extensions  to  degrees  4  and  5  in  the  variables  follow  immediately  from 
the  results  for  the  quadratic: 

Degree  4;     <5  1>,  <4  2),  <4>,  <4>,  <22>,  <0>. 
Degree  5;     <7  1>,  <5  1>,  <4  2>,  <4  2>,  <4>,  <0>. 

The  advantages  of  the  symplectic  representation,  however,  are  much 
more  in  evidence  when  dealing  with  5-dimensional  spinors,  for  these  present 
considerable  difficulty  by  other  methods. 

The  concomitants  of  a  basic  spinor  are  known.  f  The  spinor  of  type  [f  J] 
corresponds  to  the  symplectic  form  of  type  (2  1>.  This  is  not  an  easy  form 
to  deal  with,  but  the  types  of  concomitants  can  be  found  by  means  of 
an  artifice,  using  the  results  for  the  corresponding  orthogonal  form  in 
4  variables. 

Denote  for  the  present  the  characters  of  the  4-dimensional  orthogonal 
and  rotation  groups  by  [A]'  and  [A]  respectively. 

Then,  since  <2  1>  =  {2  !}-{!}  =  [2  1]', 

it  follows  that  <2  1>  <g>  {n}  =  [2  1]'  ®  {n}. 

The  orthogonal  character  [21]'  corresponds  to  the  double  binary  character 

{3}  {!}'  +  {!}  {3}'. 
Hence 

({3}  {!}'  +  {!}  {3}')®  {2}  =  ({3}  {!}')©  {2}  +  ({l}  {3}')®  {2}  +  {3}  {!}'  {1}  {3}' 
=  {6}  {2}'  +  {2}  {2}'  +  {4}  {0}'  +  {0}  {0}'  +  {2}  {6}' 


+{4}  {4}'  +  {4}  {2}'  +  {2}  {4}'  +  {2}  {2}'. 


t  Littlewood(lO). 
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Thus  [2  1]'  ®  {2}  =  [42]  +  [2]  +  [22]-*-  [0]  +  [4  -  2]  +  [2] 

+  [2  _2]  +  [0]  +  [4]  +  [3  l]  +  [3  - 
=  [4  2]'  +  [2  2]'  +  [3  1]'  +  [4]'  +  3[2]'  +  2[0]'. 
Since  {4  2}  -  [4  2]'  +  [4]'  +  [3  1]'  +  [22]'  +  2[2]'  +  [0]', 


it  follows  that        <2  1>  ®  {2}  =  [2  1]'  <g>  {2} 

-{4  2}  +  {2} 


Similarly 

({3}  {! 

=  ({3}  {!}')  ®  {3}  +  ({3}  {!}')  ®  {2}  {1}  {3}'  +  {3}  {!}'  ({1}  {3}')  ®  {2} 


=  W  {3}'  +  {5}  {3}'  +  {3}  {3}'  +  {7}  {!}'  +  {5}  {!}'  +  {3}  {!}'  +  {1}  {!}' 
+  {7}  {5}'  +  {7}  {3}'  +  {7}  {!}'  +  {5}  {5}'  +  {5}  {3}'  +  {5}  {!}' 
+  {3}  {5}'  +  {3}  {3}'  +  {3}  {!}'  +  {1}  {5}'  +  {1}  {3}'  +  {1}  {!}' 
+  {5}  {3}'  +  {3}  {3}'  +  {1}  {3}'  +  {5}  {7}'  +  {3}  {7}'  +  {1}  {7}' 
+  {5}  W  +  {3}  {5}'  +  {1}  {5}'  +  {5}  {3}'  +  {3}  {3}'  +  {1}  {3}' 
+  {5}  {l}'  +  {3}  {!}'  +  {!}  {l}'  +  {3}  {5}'  +  {3}  {3}'  +  {3}  {!}' 
+  {3}  {»}'  +  {3}  {5}'  +  {3}  {3}'  +  {1}  {7}'  +  {1}  {5}'  +  {1}  {3}' 


Hence 

[2  1]'  ®  {3}  =  [6  3]'  +  [4  1]'  +  2[3]'  +  [4  3]'  +  [3  2]'  +  [2  1]'  +  2[1]'  +  [6  1]' 
+  [5  2]'  +  [4  3]'  +  2[5]'  +  [4  1]'  +  [3  2]'  +  [4  1]'  +  2[3]7 
+  [2  I]7  +  [3  2]'  +  [2  1]'  +  2[1]'  +  [4  1]'  +  2[3]'  +  [2  1]' 
=  {6  3}  +  {4  3}  +  {5}  +  {4  l}  +  {3  2}~{1} 


The  irreducible  concomitants  up  to  degree  3  in  the  coefficients  of  the 
symplectic  form  of  type  <2  1>  are  thus  of  the  following  types: 

Degree  1;     <2  1>. 
Degree  2;     <3  1>,  <2>,  <2>. 

Degree  3;     <6>,  <4  3>,  <3  2>,  <3  2),  <3>,  <3>,  <2  1>,  <2  1>. 
SUB.  2.     voi,.  60.     NO.  2402.  Y 
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The  irreducible  concomitants  up  to  degree  3  in  the  coefficients  of  the 
5-dimensional  spinor  of  type  [f  £]  are  thus  of  types  : 
Degree  1;     [fj]. 
Degree  2;     [2  1],  [1  1],  [1  1]. 
Degree  3;     [f  f  ],  [|  «,  [f  fl,  [f  «,  [|  f  ],  [|  |],  [|  fl,  [f  «. 

The  spinor  of  type  [f  f  ]  gives  a  much  simpler  calculation,  as  it  corresponds 
to  the  symplectic  cubic. 

Since  <3>  =  {3}, 

then  {3}®  {2}  =  {6}  +  {4  2} 


2}  +  {6  3}  +  {5  2  2}  +  {4  4  1}. 
The  ^-function  {5  2  2}  is  expressed  in  terms  of  $  -functions  with  ^  2  parts 
by  the  method  described  in  "Restricted  Groups".  Thus 

{5  2  2}  =  {3;  2}  =  {5}  +  {4  l}  +  {3  2}-{3}-{2  1} 


Also  {441}  =  {4  3}  =  <4  3>-h<3  2>  +  <2  !>  +  <!>. 

Thus 


+  <4  l>  +  <3  2>  +  <4  3>  +  <3  2>  +  <2 
Skailarly 

<3>®{4}  =  <1  2>  +  <102>  +  <93>  +  <9  l>  +  <8  4>  +  <8  2>  +  2<8>  +  2<7  3> 
4-3(7  l>  +  <6  6>  +  <6  4>  +  5<6  2>  +  2<6>  +  <5  5>  +  3<5  3> 
+  4<5  1>  +  2<4  4)  +  4<4  2>  +  5<4>  +  3<3  3>  +  3<3  1> 
+  3<22>  +  <l  1>  +  <0>. 

The  types  of  irreducible  concomitants  up  to  degree  4  are: 

Degree  1;  <3>. 

Degree  2;  <4  2>,  <3  1>,  (2>. 

Degree  3;  <6  3>,  <5  2),  <5>,  <4  3>,  2<4  1>,  2<3  2>,  <3>,  <2  1>,  <1>. 

Degree  4;  <6  6>3  <6  4>,  <6  2>,  <6>,  <5  5>,  3<5  3>,  2<5  1>,  2<4  4>, 

4(4  2>,  <4>,  3<3  3>,  3<3  1>,  3<2  2>,  <1  1>,  <0>. 

The  types  of  irreducible  concomitants  of  a  5-dimensional  spinor  of  type 
[f  f  ]  up  to  degree  4  in  the  coefficients  are  : 

Degree  1;     [f  f]. 
Degree  2;     [3  1],  [2  1],  [1  1]. 

Degree  3;     [f  f],  [f  |],  [f  f  ],  [|  £],  2[f  f]5  2[f  J],  [f  f],  [f  £],  [*  «. 
Degree  4;     [6],  [5  1],  [4  2],  [3  3],  [5],  3[4  1],  2[3  2],  2[4],  4[3  1],  [2  2], 

3[3],  3[2  1],  3[2],  [1],  [0]. 
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6.    Six  variables 
If  the  latent  roots  of  an  orthogonal  matrix  in  6  variables  are 

e±i<*,     e&P,     e±i>yj 
then  the  latent  roots  of  the  complete  basic  spin  matrix  are 


For  the  rotation  group  the  complete  spin  representation  which  is  of  degree 
8  separates  into  two  conjugate  representations  each  of  degree  4,  according 
to  the  equation  [i  HI'  =  [*  i  «  +  EH  -«• 

The  latent  roots  of  one  which  is  associated  with  [£•£•}],  may  be  expressed  as 


Hence      [J  H]  ®  {2}   =  e««+*+?>  +  e«a-*-r>  +  e^-r-«>  +  e«r-«- 
+  e*«  +  e 

=  [1  1  1], 
[J  \ 

[J  i 

=  [**-«- 

From  a  consideration  of  principal  parts,  it  is  clear  that  [-J-  J  -J]  ®  {^>,  g,  r} 
contains  [%(p  -\-g_-r),  %(p  —  q  +  r),  %(p~-  q  —  r)].  Since  it  is  readily  verified 
that  the  number  of  terms  is  the  same  in  each  case,  equality  follows,  and 

^^^®{pqr}  =  [^p  +  q-r))^p-q  +  r)^(p-q-r)].  (6-1) 

Again,  since  ti  H]  <8>  {I2}  «  [1] 

and  the  quaternary  linear  complex  has  a  quadratic  invariant 


it  follows  that  the  operation  ®  may  be  followed  by  an  orthogonal  character, 


Thus        {P}®  [J(p  +  q-r),  ±(p-q  +  r),  ^p-q-r)]  =  {p  q  r} 
or  {I2}®[u  v  w]  =  {u  +  v,  u  —  w,  v  —  w}.  (6*2) 

Thus  is  established  a  complete  correspondence  between  the  characters, 
including  spin  characters,  of  the  6-dimensional  rotation  group  and  those  of 
the  4-dimensional  unitary  group.  An  isomorphism,  which  is  2  :  1  because 
of  the  2-valued  nature  of  the  spinors,  between  the  two  groups  follows. 
A  complete  correspondence  exists  between  the  invariant  theories  of  the 
two  groups. 

Y2 
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This  isomorphism  has  been  used  by  Turnbullt  to  obtain  the  concomitants 
of  the  quaternary  quadratic  complex  from  those  of  the  orthogonal  quadratic 
in  6  variables.  It  was  also  mentioned  in  "Restricted  Groups",  but  the  full 
implications,  in  particular  the  position  of  the  spinors,  were  not  discussed. 

As  with  the  orthogonal  groups  in  5  variables,  the  relative  advantages  of 
the  two  groups  depend  on  the  particular  type  considered.  Thus,  as  in  the 
case  considered  by  Turnbull,  the  6-dimensional  orthogonal  quadratic  leads 
to  a  simpler  calculation  than  the  quaternary  quadratic  complex.  On  the 
other  hand  the  reverse  procedure  is  sometimes  preferable,  particularly  in 
considering  spinors. 

A  form  of  type  [I3]  under  the  6-variable  rotation  group  corresponds  to 
a  quaternary  quadratic  for  which  the  irreducible  concomitants  are  of  types: 

Degree  1;     {2}. 
Degree  2;     {22}. 
Degree  3;     {23}. 
Degree  4;     {0}. 

Thus  the  types  of  irreducibles  for  a  form  of  type  [I3]  are: 
Degree  1;     [11  1]. 
Degree  2;     [2]. 
Degree3;     [1  1  -1]. 
Degree  4;     [0]. 

This  is  complete  for  all  degrees. 

The  extension  to  the  full  orthogonal  form  of  type  [1  1  1]'  is  straight- 
forward but  will  not  be  considered  here. 

The  spinor  of  type  [f  f  f  ]  corresponds  to  a  quaternary  cubic,  for  which 
the  types  of  concomitants  up  to  degree  6  have  been  found. f  From  these 
results  the  types  of  irreducible  concomitants  of  the  spinor  may  be  written 
down: 

Degree  1;  [f  }  f]. 

Degree  2;  [3  1  1], 

Degrees;  [|f  f],  [||*],  [|i -«. 

Degree  4;  [6],  [4  3  1],  [4  2],  [23],  [3  1],  [2  2  -  2]. 

Degree  5;  ft  f  |],  [|  f  J],  [J  f  f],  [Jf  |  -  «,  [|  }  J],  [f  f  «, 

S  I  f],  [|  f  -  f].  tt  *  «.  »  f  -«,  [f  f  -*]• 

Degree  6;  [6  4  1],  [7  2],  [5  2  2],  [5  3  2],  [4  4  1],  [4  3  2],  [6  1],  [63-  1], 
[5  2],  [5  1  1],  [4  3],  [4  2  1],  [3  3  1],  [5  1  -  1],  [4  2  -  1], 
[4  1],  [3  2],  [2  2  1],  [3  3  -3],  [3  1  - 1],  [1  1  1]. 

f  Turnbull(13).  J  Littlewood(8). 
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The  spinor  of  type  [f  J  J]  corresponds  to  {2  1}.  Since  in  4  variables 

{2  1}®  {2}  =  {4  2}  +  {3  2  !}  +  {23}-h{3  P}, 

{2  1}®{3}  =  {6  3}  +  {5  3  l}4-{4  3  2}-f  {5  2 

4-2(4  22  l}  +  {4  3  !2}  +  {32  2 
it  follows  that  for  the  6-variable  rotation  group 
[1*11®  {2}  =  [3  1  l]  +  [2  !]  +  [!  1  -!]  +  [!  1  1], 


The  irreducible  concomitants  are  of  types  : 

Degree  1;     [|  J  J]. 

Degree  2;     [2  1],  [1  1  -  1],  [1  1  1]. 

Degree  3;     [f  f  £],  [H  -  «,  2[f  f  «,  [f  i  *],  [|  i  -  «,  [i  i  J],  [f  I  -  §]. 

These  last  two  systems  are  not  complete;  the  irreducibles  continue  for 
considerably  higher  degrees. 

7.    Seven  and  eight  variables 

The  case  of  eight  variables  is  first  considered.  There  are  two  conjugate 
basic  spin  representations  of  the  rotation  group  in  8  variables,  each  of 
degree  8,  and  these  each  give  automorphisms  of  the  rotation  group. 

Let  the  latent  roots  of  the  orthogonal  matrix  in  8  variables  be 

e±*«,     e±^,     e±*y,     e±is. 
Then  the  latent  roots  of  the  complete  basic  spin  matrix  are 


For  the  rotation  group  the  spin  representation  separates  into  two  conj  ugate 
representations,  according  to  the  equation 


the  latent  roots  corresponding  to  the  one  being  those  which  have  an  even 
number  of  negative  signs,  and  corresponding  to  the  other  an  odd  number  of 
negative  signs. 

Put  a'  =  i 
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Then  the  latent  roots  corresponding  to  [£  £  J  £]  are 


It  is  convenient  to  denote  [i  4  4  4]  by  A,  and  [4  4  4  ~  4]  by  A. 
Clearly 

A  ®  {2}  =  e 


-«a-^  -f  4 


Hence  the  spin  representation  has  a  quadratic  invariant.  It  is  thus  an 
orthogonal  group  and  possesses  a  pair  of  conjugate  basic  spin  representa- 
tions. The  latent  roots  of  the  representation  corresponding  to  [^  \  \  41  are 


where 


The  character  of  the  representation  is  thus  the  character  [J  i  4  —  il  °f  ^ne 
original  orthogonal  group.  Thus 

A®  A  =  A. 
Similarly  aw  =  \(QL"  +  ft"  +  y"  +  S")  =  a, 


and  A®A®  A  =  A®  A  =  A®  A  =  [1]. 

Now  A  ®  [1]  =  [H  i  iL 

A®  [P]  =[11], 


A®  [4  H  -«  =  [!]• 

It  follows  from  a  consideration  of  principal  parts  that 
A  <S)  [#  q  r  s]  —  [u  v  w  z], 
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Thus    A  <g)  [> 


r-s), 


The  operation  A  <g>  is  therefore  equivalent  to  the  operation  of  the  matrix 

111-1 
11-11 
1-111 

Li    _i    _i    _U 

on  the  column  vector  whose  4  components  are  the  4  parts  p,  q,  r,  s  of  the 
partition  defining  the  representation. 

The  operation  A  ®  corresponds  to  the  matrix 

1111 
1  1-1-1 
1-1  1-1 
1  1  1  -1J 

Every  rotation  group  with  an  even  number  of  variables  possesses  an 
automorphism  which  interchanges  conjugate  representations.  For  8  vari- 
ables this  corresponds  to  the  matrix 

J  = 


.      .    -1J 

operating  on  the  column  vector  of  the  4  parts. 
Since  JM  =  M*  J, 

JM2  =  MJ, 

it  is  clear  that  the  group  generated  by  J  and  M  is  the  symmetric  group  of 
order  3'.  Hence  the  group  of  outer  automorphisms  of  the  rotation  group 
in  8  variables,  or  at  least  of  its  covering  group,  is  the  symmetric  group  of 
order  3!. 

For  v  ^  3,  the  whole  group  of  automorphisms  of  the  rotation  group  in  2v 
variables  is  the  full  orthogonal  group,  but  for  v  =  4  it  is  a  larger  group  which, 
it  will  be  shown,  is  not  expressible  as  a  linear  group  in  less  than  24  variables. 
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The  characters  are  obtained  from  the  characters  of  the  rotation  group  as 
follows.  Let  A  denote  the  partition  [1],  B  the  partition  [•&£££•],  C  the 
partition  [£££—£]  and  X  the  partition  [11].  Then  every  partition  may  be 
expressed  in  the  form 


where  p,  q,  r,  s  are  positive  or  zero  integers. 

There  is  a  character  of  the  whole  group  of  automorphisms  corresponding  to 


+  [qA  +  rB  +pC  +  sX]  +  \r 

provided  that  p,  q,  r  are  all  distinct. 
If,  however,  p  =  q  4=  r,  corresponding  to 


sX]  +  [rA  +qB+pC+  sX], 


there  are  two  associated  characters,  in  the  second  of  which  the  sign  is 
changed  for  every  element  which  corresponds  to  one  of  the  outer  auto- 
morphisms J,  M  J,  or  JM  . 

For  p  =  q  =  r,  corresponding  to 

[pA+pB+pC  +  sX] 

there  are  three  characters,  which  are  obtained  from  the  first  of  them  by 
multiplying  by  the  characters  of  the  symmetric  group  of  order  3  !  ,  according 
to  the  outer  automorphism  to  which  the  group  element  belongs,  i.e. 


/ 

J,  MJ,  JM 

MM* 

1 

2 
1 

1 
0 
j 

1 
-1 

1 

The  lowest  degree  of  the  characters,  apart  from  those  which  correspond  to 
the  partition  [0]  and  thus  give  only  the  characters  of  the  group  of  outer 
automorphisms,  is  24  which  is  the  degree  of  [IJ-f-A-fA.  Hence  a  repre- 
sentation of  the  group  as  a  linear  group  must  have  a  minimum  degree  of  24. 

Concerning  the  applications  to  invariant  theory,  the  group  itself  cannot 
be  simplified  as  the  isomorphism  is  an  automorphism.  Concomitants  of 
some  forms,  however,  may  be  obtained  from  those  of  other  forms  whose 
types  are  easier  for  calculation. 

Thus,  since  A®  [2]  =  [I4], 

the  concomitants  of  a  form  of  type  [I4]  may  be  obtained  from  those  of  a 
quadratic.  Following  the  method  described  in  "Restricted  Groups",  the 
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irreducible  concomitants  up  to  degree  4  of  an  8-variable  orthogonal  quad- 
ratic are  found  to  be  of  the  following  types: 

Degree  1;  [2]. 

Degree  2;  [22],  [2],  [0]. 

Degree  3;  [3  1],  [23],  [22],  [2],  [0]. 

Degree  4;  [3  2  1],  [3  1],  [2*],  [2*1,  [2*],  [2*],  [2],  [0]. 

Thus  the  types  of  irreducible  concomitants  up  to  degree  4  of  a  form  of  type 
[I4]  are: 

Degree  1;     [I4]. 

Degree  2;     [2*],  [I4],  [0]. 

Degree  3;     [22  I2],  [3  I2  -  1],  [22],  [I4],  [0]. 

Degree  4;     [3  2  1],  [22  p],  [4],  [3  I2  -  1],  [22],  [22],  [I4],  [0]. 

The  method  may  be  employed  to  convert  spinors  into  true  representations. 
The  case  of  the  basic  spinor  of  type  A  is  of  course  trivial,  as  it  corresponds 
to  a  simple  vector  of  type  [1],  for  which  the  only  irreducible  concomitant 
is  the  quadratic  invariant.  The  complete  basic  spinor  of  type  A  +  A  has  been 
considered  elsewhere,  f 

Consider  the  spinor  of  type  [f  -J  J  J],  Since 


the  concomitants  of  the  spinor  can  be  deduced  from  those  of  a  form  of 
type  [1  1  1]. 
Then 

[1  1  1]®{2}  =  {1  1  1}®{2} 

=  {222}  +  {2  I4} 

=  [222]  +  [22]  +  [2]  +  [0]  +  [2  i  1]  +  [14], 
(1  1  l}®{3}  =  {333}  +  {322  1  l}  +  {23  !3}  +  {3  I6} 

=  [333]  +  [32  l]  +  [322]  +  [32  1  1]  +  [31  l]  +  [3  1  l]  +  [3] 

+  [2  2  2  1]  +  3[2  2  1]  +  2[2  11]  +  2[2  1]  +  3[1  1  1]  -f-  [1], 
{1  1  1}  <x>  {4}  -  {43}  +  {4  32  I2}  -f  {4  24}  +  {4  22  I4}  +  {33  I3}  +  {32  22  I2} 


2  l]  +  [43  l]  +  3[422] 
+  [42  I2]  +  2[4  2]  +  2[4  I2]  +  2[4]  +  [33  1]  +  [32]  +  3[33  2] 
+  3[32  I2]  +  4[3  22  1]  +  8[3  21]  +  7[3  I3]  +  3[3  1]  +  2[24] 
+  7[23]  +  4[22  I2]  +  8[22]  +  10[2  I2]  +  4[2]. 

t  Littlewood(lO). 
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If  we  delete  reducible  concomitants,  the  list  of  irreducibles  up  to  degree  4 
is  as  follows: 

Degree  1;     [111]. 

Degree  2;     [22],  [2  I2],  [2],  [I6],  [0]. 

Degree  3;     [3  2  I2],  [3  I2],  [3],  3[22  1],  2[2  I3],  2[2  1],  2[13],  [1], 

Degree  4;     [4  2  I2],  [4  2],  [4],  [32],  3[32  I2],  [3  22  1],  6[3  2  1], 

5[3  I3],  3[3  1],  [24],  4[23],  4[22  I2],  7[22],  8[2  I2], 

3[2],  3[14],  2[12]. 

Hence  the  types  of  irreducible  concomitants  up  to  degree  4,  of  a  spinor 
of  type  [f  •&  •£  $],  are  as  follows: 


Degree  1;     [$ 

Degree  2;     [22],  [2  I2],  [I3  -  1],  [2],  [0]. 

Degree  3;     [f  f  J  -fl,  [f  I  f  -  «,  [f  f  I  -»],  3[f  H  «, 

211  *  i  -«,  2[f  |  *  -«,  2[|  *  HLU  H  -*]- 

Degree  4;     [4  2  1  -  1],  [3  3  1  -  1],  [2  2  2  -  2],  [32],  3[4  2],  [4  I2], 
6[3  2  1],  5[3  1  1  -  1],  3[2  2  1  -  1],  [4],  4[3  I3], 
4[3  1],  7[22],  8[2  I2],  3[1  1  1  -  1],  3[2], 


The  basic  spin  representation  of  the  orthogonal  group  in  7  variables  is 
of  degree  8.  The  group  of  the  spin  transformations  is  thus  a  sub-group  of  the 
corresponding  group  for  8  variables. 

The  rotation  group  in  7  variables  can  be  obtained  from  that  in  8  variables 
by  keeping  a  vector  of  type  [1]  invariant.  To  the  type  [1]  corresponds  the 
spinor  type  [(£)*]. 

Hence  the  method  gives  a  representation  of  the  7  -variable  rotation  group 
as  that  sub-group  of  the  8  -variable  rotation  group  which  is  obtained  by 
keeping  a  spinor  of  type  [(J)4]  invariant. 

The  existence  of  this  sub-group  is  of  some  interest,  but  otherwise  the 
isomorphism  would  seem  to  have  little  practical  significance  from  the 
point  of  view  of  invariant  theory. 
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SOME  PROPERTIES  OF  A  CERTAIN  DOUBLE  SURFACE 
IN  SPACE  OF  FOUR  DIMENSIONS 

By  R.  R.  JONES 
[Received  10  December  1945 — Read  13  December  1945] 

The  figure  of  six  general  points  in  four-space  has  received  recent  attention 
in  a  paper  by  Bronowski  ( 1 ).  There  he  discusses  the  system  of  quartic  curves 
through  the  six  points,  and  studies  in  particular  the  subsystem  formed  by 
those  curves  which  further  meet  a  plane;  hence  the  configuration  of  sets  of 
three  associated  quartics  is  determined.  We  are  here  concerned  in  detail 
with  the  manifold  filled  by  the  same  subsystem  and  in  particular  with  the 
surface  which  is  doubly  contained  by  that  threefold. 

1 .   The,  primal  V  in  [4] 

We  maintain  the  notation,  used  in  (1),  that  (q)  denotes  the  triple  infinity 
of  normal  rational  curves  of  order  four  in  four-space  which  pass  through 
six  points  (At)  (i  =  1,  2, ...,  6),  fixed  in  general  position. 

Those  curves  of  (q)  which  meet  a  general  line,  I  say,  generate  a  surface, 
F9  as  we  have  called  it  in  (2).  That  F9  is  of  order  nine  was  proved  there  by 
showing  that  the  plane  representation  of  prime  sections  of  F9  is  by  quintic 
curves  through  (i)  the  fifteen  points  of  intersection  of  pairs  of  six  fixed 
lines,  representing  (A^  which  are  fourfold  points  on  the  surface,  and 
through  (ii)  the  point  L  say,  corresponding  to  I. 

When  the  six  lines  are  each  tangent  to  a  conic  S,  F9  becomes  a  specialized 
surface  F°L.  We  have  shown  that  F9  and  F^  are  the  general  surfaces  of  their 
type  and  are  generated  each  by  oo1  curves  of  (q),  represented  by  the  pencil 
of  lines  through  L.  We  now  indicate  the  proof,  used  in  (2),  that  F9L  contains 
doubly  that  curve  of  (q)  to  which  I  is  chordal.  To  show  this,  we  employed 
a  special  case  of  a  theorem  which  is  in  fact  the  key  result  of  (2),  namely: 
in  space  of  s  dimensions,  the  primals  of  order  n  -f  1  which  pass  through  the 
secunda  of  intersection  of  n  +  2  primes  osculating  a  given  normal  rational 
curve  of  order  8  and  through  two  of  the  vertices  of  an  independent  simplex 


1945]  PROPERTIES  OF  A  CERTAIN  DOUBLE  SURFACE  381 

osculating  this  curve,  necessarily  pass  through  the  remaining  vertices  of 
the  simplex.  By  taking  n  =  4  and  s  =  2  in  this  result,  we  found  that  there 
is  an  involution  of  pairs  of  points  set  up  on  the  tangents  from  L  to  the  conic 
S  by  the  curves  of  the  system  representing  prime  sections  of  F*L.  Accordingly, 
we  deduced  that  each  of  the  tangents  from  L  to  £  represents  a  curve  which 
has  I  as  chord  and  is  double  on  F9L. 

The  double  infinity  of  those  curves  of  (q)  which  in  addition  to  passing 
through  the  points  (At)  meet  a  given  arbitrary  plane,  p  say,  generate  a 
threefold,  which  we  have  called  V.  To  find  the  order  of  V,  we  obtain  the 
number  of  points  in  which  it  is  met  by  a  general  line,  say  I  as  above.  But 
those  curves  of  (q)  which  meet  I  generate  the  novenic  surface  F9,  and  so 
nine  of  these  curves  meet  the  plane  p.  That  is,  the  generators  of  V  meet  I 
in  nine  points  so  that  V  is  of  order  nine. 

The  three-space  representation  of  prime  sections  of  V  is  by  quintic  sur- 
faces,* (F)  say,  through  a  point  P  and  through  (Ati)  (i,j=  1,2,  ...,6, 
i  =f=J,  A^  ==  A^  the  fifteen  lines  of  intersection  of  six  planes  (a^  (i  —  1, 2, . . .,  6) 
which  necessarily  osculate  a  twisted  cubic  curve,  C  say. 

We  now  list,  for  later  use,  some  properties  of  V,  readily  obtainable  from 
the  representation :  in  each  case  the  three-space  configuration  occurs  first : 
six  planes  al5a2,  ...,«6->six  sixfold  points  Al9AZi  ...,^L6; 
fifteen  lines  (A^),  simple  on  (jP)->  fifteen  lines  (At  A ^)  which  are  triple 

on  V\ 

twenty  points  such  as  N^kt  the  intersection  of  the  lines  A3-k,  Aki,  Aip 
are  double  points  on  the  surfaces  (F),  so  that  the  tangent  (quadric) 
cones  at  Niik  to  two  general  members  of  (F)  have  only  one  free  inter- 
section there ->  twenty  planes  such  as  A^A^A^  and  these  are  simple; 
genus  of  nO10,  the  decimic  curve  of  intersection  of  two  general  members 

of  (F),  is  eleven ->  section-genus  of  V  is  eleven; 
cubic  O~>a  curve  <715  of  order  fifteen,  which  has  six  triple  points  at  (At) 

at  each  of  which  the  curve  has  a  unique  tangent; 
base  point  P->  plane  p  and  this  is  simple  on  V; 

double  infinity  of  lines  through  P->  oo2  curves  of  (q)  which  generate  V\  and 
each  of  the  three  lines  of  intersection  of  pairs  of  osculating  planes  from 
P  to  O-»the  unique  curve  of  (q),  say  g1?  to  which  p  is  trisecant  and 
which  is  therefore  triple  on  V. 

There  is  an  involution  connecting  by  pairs  the  oo2  lines  through  P :  these 
pairs  represent  those  curves  of  (q)  which  form  with  q1  sets  of  "associated" 


*  This  is  seen  from  (2),  169,  by  taking  n  =  4  and  a  =  3  in  the  generalized  result. 
Conversely,  it  can  be  shown  by  counting  the  linear  conditions  involved  in  [3]  and  [4] 
that  V  is  the  general  threefold  of  its  type. 
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curves.  We  shall  see  that  these  pairs  of  quartics  are  always  represented  by 
two  lines  through  P  even  when  the  quartics  coincide  and  become  double 
curves  of  V,  meeting  the  plane  p  in  two  points. 

In  summary:  the  oo2  normal  rational  quartic  curves  which  pass  through  six 
fixed  points  in  [4]  and  which  meet  a  given  plane  generate  a  threefold  of  order 
nine,  on  which  the  base  points  are  sixfold,  the  lines  joining  pairs  of  these  points 
are  triple,  and  each  plane  determined  by  three  of  the  base  points  is  simple,  as 
is  the  given  plane ;  this  plane  is  trisecant  to  a  unique  generating  curve,  which  is 
triple  on  the  primal. 

2.  Some  surfaces  lying  on  V 

A  general  plane  of  [3]  meets  the  curve  ltClQ  in  ten  points  and  so 
represents  a  surface  of  order  ten  which  lies  on  V  and  passes  through  each 
sixfold  point.  If  the  plane  also  passes  through  P  the  corresponding  surface 
is  of  order  nine:  every  surface  of  this  type  on  F  meets  the  plane  p  in  a  line. 
Further,  each  such  surface  is  filled  by  the  oo1  curves  of  (q)  which  meet  that 
line  in  p:  these  are  represented  by  the  pencil  of  lines  through  P  in  the 
mapping  plane.  Hence  the  surface  is  of  the  type  F9  of  §  1,  but  not  in  general 
of  the  type  F\  since  the  traces  of  (at)  on  the  plane  do  not  necessarily  all 
touch  the  same  conic. 

When  the  plane  through  P  contains,  for  example,  the  line  A56  the  corre- 
sponding surface  is  the  projection  of  either  an  F9  or  an  FgL  from  one  of  the 
multiple  points.  Such  projected  surface  we  call  F5L:  in  our  notation  F6 
is  necessarily  F^. 

By  considering  the  (five)  traces  on  P-456  of  the  osculating  planes  (a^  in 
[3]  we  see  that  F*L  may  be  regarded  as  the  projection  from  one  of  its  general 
points  of  a  special  Bordiga  sextic  surface,  designated  F*L  by  Room  (3,  395). 
FL  is  part  of  the  section  of  V  by  the  prime  A1A2A3A^  say  TT.  For,  in  [3] 
the  four  base  planes  alt  a2,  a3,  «4  form  with  PA5Q  a  degenerate  surface  of  the 
system  (F).  Thus  the  section  of  V  by  TT,  residual  to  the  four  (simple)  planes 
joining  triads  of  the  "vertices"  of  TT,  is  FbL.  This  surface  accordingly  has 
five  triple  points,  four  at  Alf  A2,  A3,  AA  and  the  fifth  at  the  intersection  of 
n  with  the  (triple)  line  A5AQ. 

The  plane  representation  on  PA5Q  shows  in  addition  that  F5L  is  generated 
by  oo1  twisted  cubic  curves  meeting  a  directrix  line,  namely,  that  line  in  p 
which  corresponds  to  the  neighbourhood  of  P  in  P-456.  One  of  these  cubics 
is  double  on  JFJL,  being  chordal  to  the  directrix  line,  and  is  represented  by 
either  of  the  tangents  from  P  to  the  unique  conic  enveloped  by  the  traces 
of  (at)  on  PA56  (for  these  and  further  details  see  2,  154-155). 

To  summarize:  every  general  plane  through  the  point  P  in  [3]  represents 
on  V  in  [4]  a  surface  Fg  which  is  of  order  nine,  has  fourfold  points  at  the  sixfold 
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points  on  V,  and  is  filled  by  those  generating  curves  of  V  which  meet  a  line  lying 
in  the  plane  p.  Each  of  the  fifteen  planes  through  P  which  contain  one  of  the 
lines  (A^)  represents  on  V  a  surface  F*L.  In  general  these  surfaces  F9  and 
F*L  meet  the  curve  <715  on  V  in  three  distinct  points. 

3.  The  double  surface  on  V 

In  particular,  the  three  planes  which  can  be  drawn  through  P  to 
osculate  C  represent  surfaces  F9  which  are  also  F9L  and  have  three-point 
contact  with  <715.  For,  each  of  these  planes  is  met  by  the  six  given  planes 
(a^  in  lines  which  are  tangent  to  the  same  conic.  Each  of  the  three  planes 
therefore  meets  the  surfaces  (F)  in  quintic  curves  which  pass  through  P 
and  the  fifteen  points  of  intersection  of  six  tangents  to  a  conic.  In  fact,  there 
is  on  each  of  the  three  osculating  planes  from  P  the  representation  of  an 
^/>as  given  in  §1. 

But  there  are  in  all  oo1  planes  which  pass  through  P  and  meet  (a^  in 
tangents  to  a  conic.  For,  consider  the  dual  of  the  well-known  result  that: 
the  locus  of  the  vertices  of  the  cones  in  the  oo3  system  of  quadric  surfaces 
which  contain  six  general  points  in  three-space  is  Weddle's  quartic  surface. 
A  general  plane  accordingly  meets  the  locus  in  a  quartic  curve,  each  of  the 
oo1  points  on  the  curve  being  a  vertex  of  a  quadric  cone  through  the  six 
points. 

Dually,  therefore,  the  envelope  of  the  planes  meeting  six  given  planes  in 
tangents  to  a  conic  is  the  surface  dual  to  Weddle's  surface.  There  are  oo1 
tangent  planes  from  a  general  point  to  this  dual-surface,  that  is,  there  are 
oo1  planes  through  P  which  meet  (aj  in  tangents  to  a  conic  and  which 
accordingly  represent  surfaces  F9L  on  V. 

Let  w  be  any  general  plane  of  this  system.  Then  there  are  two  lines  in  w 
drawn  from  P  tangent  to  the  conic  which  is  touched  by  the  six  lines  of 
intersection  of  w  with  (at).  We  already  know  that  these  two  tangents 
represent  the  same  quartic  curve,  namely  the  double  curve  of  the  F*L 
corresponding  to  w.  Thus,  as  w  varies,  the  oo1  pairs  of  such  tangent  lines 
from  P  generate  a  conical  surface  which  is  the  map  of  a  double  surface, 
S  say,  on  K  S  is  generated  by  those  curves  of  (q)  which  meet  p  twice. 

The  order  of  S  is  the  number  of  points  in  which  it  meets  a  general  plane  t 
of  the  four-space.  But  t  meets  F  in  a  curve  of  order  nine  whose  genus  we 
have  noted  is  eleven.  Hence  this  curve  has  Jx8x7  —  11  =  17  double  points. 
These  lie  at  the  meets  of  t  with  the  double  surface  and  there  only  since  there 
is  no  other  double  surface  on  V.  Thus  S  is  of  order  seventeen. 

That  is,  the  single  infinity  of  curves  ofl(q)  which  meet  twice  the  plane  p 
generate  a  surface  S  which  is  double  on  the  threefold  V  and  is  of  order  seventeen. 


384  R.  R.  JONES  [13  Dec. 

The  definition  of  w  as  a  tangent  plane  to  the  surface  dual  to  Weddle's 
surface  has  an  interesting  interpretation.  Since  a  general  line  in  [3]  meets 
the  Weddle  surface  in  four  points,  there  are  dually  four  planes  which  pass 
through  a  general  line  and  touch  the  dual-surface.  In  particular  there  are 
four  planes  (w)  through  a  line  r  which  passes  through  P.  As  we  have  seen, 
there  correspond  to  these  planes  four  surfaces  of  the  type  F9L  on  V  whose 
directrix  lines  lie  in  p.  Further,  these  four  lines  all  pass  through  the  point  of 
intersection  of  p  with  that  curve  of  (q)  which  is  mapped  by  r  in  [3]  and  which 
is  common  to  all  four  surfaces  F*L.  In  other  words,  if  c  is  the  curve  in  which 
p  meets  S,  there  pass,  through  any  general  point  of  p,  four  joins  of  point- 
pairs  of  the  involution,  y\  say,  formed  on  c  by  the  pairs  of  points  in  which  p 
meets  the  quartics  generating  S.  The  genus  of  the  y\  is  seen  to  be  three,  by 
correspondence  with  the  genus  of  a  plane  section  of  the  Weddle  surface. 

Since  a  general  point  of  p  determines  a  unique  curve  of  (q)  we  may  restate 
the  above  result  thus:  through  a  given  general  member  of  the  curves  generating 
V  there  pass  oo1  surfaces  of  the  type  F9  lying  in  V  of  which  four  are  FgL ;  the 
double  curves  of  these  F*L  are  generating  curves  of  the  double  surface  S. 

4.  A  prime  section  of  the  double  surface 

We  now  examine  the  intersection  of  the  double  surface  with  the  fixed 
prime  TT,  that  is,  A1A2A3A4:.  The  curves  which  form  this  section  are  com- 
ponents of  composite  curves  of  (q)  which  meet  the  plane  p  twice,  since  no 
curve  of  (q)  can  meet  A1A2A9A4[  again  without  having  part  of  itself  in  that 
solid. 

(a)  The  line  A^A^  is  one  such  component:  for  if  A1A2  meets  the  prime 
A3A^A5A^  in  the  point  J312,  a  unique  cubic  curve  can  be  drawn  to  pass 
through  ^43,  A^  A5,  A^  to  meet  A^A^  and  to  meet  p  twice,  namely,  that 
cubic  which  passes  through  AB,  A^  A5,  A6,  BI2  and  has  as  chord  the  line  in 
which  p  meets  the  solid  A3A^A5AK.  (The  cubic  is  in  fact  the  double  curve 
of  an  F£  and  is  represented  by  each  of  a  pair  of  lines  through  P  meeting 
Alz:  see  §  2.)  Thus,  this  cubic  and  the  line  A^A2  form  a  reducible  curve  of  (q), 
i.e.  A^AZ  lies  on  S.  Similarly,  the  five  other  joins  of  the  "vertices"  of  n 
are  part  of  its  section  of  S. 

(b)  Let  the  plane  A^A2AB  meet  p  in  the  point  5123  and  the  plane  A±A5A6 
in  the  point  £>123==D456;  and  similarly  for  other  incidences.   The  conic 
defined  by  A^,  A2,  A^,  B123,  Z)123  lies  on  the  required  intersection:  for,  a 
unique  conic  can  be  drawn  through  A±,  A5,  A$,  B^,  Dl2a,  and  the  two  conies 
form  a  composite  member  of  the  rational  quartic  curves  which  generate  8. 
That  is,  the  conic  A^A^A^B^^D^^  lies  on  the  section  of  S  by  TT,  as  do  the 
remaining  three  similarly  defined  conies. 
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(c)  One  twisted  cubic  curve  can  be  drawn  to  pass  through  Alt  A2,  ABt  A^ 
to  meet  ABA6  at  B5$  and  to  meet  p  twice,  namely,  that  cubic  which  passes 
through  those  five  points  and  has  as  chord  the  line  in  which  the  prime 
meets  p.  Residual  to  A5A6,  this  cubic  makes  up  a  (reducible)  curve  of  (q) 
meeting  p  twice,  and  so  lies  on  S. 

The  above  details  enable  us,  in  order  to  verify  and  extend  our  knowledge 
of  the  double  surface,  to  apply  the  special  Cremona  transformation  known 
as  the  reciprocal  inversion  transformation. 


5.  An  application  of  the  reciprocal  inversion  transformation 

In  §  1  we  considered  the  quartic  curves  (q)  which  pass  through  the 
six  general  points  (At)  and  meet  a  fixed  plane  p.  We  now  consider  the  homa- 
loidal  system  of  primals  which,  based  on  the  points  (A^)  (i  =  1,  2,  ...,5)  as 
simplex  of  reference,  birationally  transforms  our  [4]  into  another  four-space, 
say  [4]'.  This  transformation  is  defined  by  the  appropriate  equations 
x't  =  k/Xf  for  the  same  range  i  —  1,2,. ..,5,  k  being  constant  and  any  one 
of  the  sixteen  self-corresponding  points  being  taken  as  unit  point. 

Next  we  list  those  details  (see  Room,  3,  180)  which  are  of  immediate 
application: 

the  base  point  A^,  for  example ->j5253J54j55,  the  prime  of  the  base 
simplex  in  [4]'  with  vertices  at  (B^  (i  =  1,  2, ...,  5); 

the  curves  (#)-> lines  through  J56,  say,  which  corresponds  to  A6] 

a  general  plane ->  a  special  Bordiga  sextic  surface  FQL\ 

qlt  the  unique  curve  of  (q)  to  which  the  plane  p  is  chordal-»the  unique 
trisecant  from  J56  to  the  corresponding  surface  F*L\ 

the  general  primes  ->  Liiroth  quartic  primals,  on  which  the  base  points 
,B15  #2, ...,  J35  are  triple,  the  lines  joining  pairs. of  these  points  are 
double,  and  the  planes  joining  triads  of  these  points  are  simple. 
Each  such  primal  is  represented  in  ordinary  space  by  quartic  surfaces 
through  the  ten  lines  of  intersection  of  five  planes.* 

In  [4]  we  have  already  seen  that  on  V,  the  double  surface  S  is  generated 
by  those  curves  of  (q)  which  meet  the  plane  p  twice.  The  lines  in  [4]'  which 
are  the  transforms  of  the  generating  curves  of  S  appear  as  the  chords  from 


*  We  note  from  the  representations,  that  .Fl[4]  projects  from  one  of  its  general 
points  into  F6L  and  from  one  of  its  triple  points  into  a  Liiroth  cubic  surface,  that  is, 
a  four-nodal  cubic  surface;  similarly,  V  and  the  Liiroth  quartic  threefold  are  the 
projections  in  five-space  of  a  threefold,  of  order  ten  with  six  sixfold  points,  from, 
respectively,  one  of  its  general  points  and  one  of  its  multiple  points. 
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JB6  to  the  surface  F*L.  The  order  of  the  conical  sheet  formed  by  these  chords 
from  B$  is  (see  Baker,  4,  242)  half  the  order  of  the  general  "chord  curve" 
on  F*L,  namely  |[(/*o~1)(/*o~2)"~2^]»  where  /*0  =  order  of  ^1  =  6  and 
p  =  section-genus  of  FQL  =  3.  Thus  the  required  order  is  seven. 

Further,  the  point  B6  is  the  vertex  of  the  cone  of  chords  and  accordingly 
A6  has  multiplicity  seven  on  IS.  Hence  each  of  the  base  points  (At]  on  8  is 
sevenfold. 

Consider  a  section  of  8  by  one  of  the  Liiroth  primals  of  the  reverse  trans- 
formation, say,  L\  corresponding  to  a  general  prime  through  B6.  If  the 
double  surface  is  of  order  h  and  has  each  join  At  A]  m-fold,  then  the  section 
consists  of  seven  curves  of  (q)  corresponding  to  the  seven  chords  through 
B6  in  the  prime,  together  with  (see  §  4)  the  ten  joins  of  pairs  of  (At)  excluding 
A$,  each  counted  2m  times,  and  the  ten  conies  such  as  that  through  AIf  A2,  Az 
(excluding  AQ),  and  through  the  two  points  in  which  the  plane  A^A^A^ 
meets  the  planes  A^A^A^  and  p.  Hence  the  curve  of  section  has  order 

4/z,  =  7  x  4  -f  10  x  2m  +10  x  2. 

Also  since  A1  is  sevenfold  on  S  but  triple  on  L\,  the  total  intersection  at 

A  j  is  of  order 
1 


From  these  equations,  h  —  17  and  m  —  1.  Thus,  8  is  of  order  seventeen  and 
contains  simply  the  lines  (A{Aj). 

It  is  likewise  easy  to  calculate  the  order  of  the  curve  c  which  S  traces  on  p. 
We  proceed  as  above. 

This  curve  is  the  locus  of  the  meets  of  p  with  those  curves  of  (q)  which 
meet  the  plane  twice.  Hence,  the  twenty  points  such  as  B123  in  which  p 
meets  the  plane  A^AzA^  (see  §  4)  are  on  this  locus  and  lie  by  sets  of  four  on 
the  fifteen  lines  in  which  primes  like  AlA2A^A4c  meet  p.  As  we  have  already 
shown,  such  a  line  has  only  two  other  points  on  the  locus,  namely,  the  two 
points  in  which  it  meets  the  unique  cubic  curve  which  passes  through 
Alt  A2,  A3,  A4,  meets  ASA6  at  B56  and  has  the  line  as  chord.  Thus  if  the  order 
of  c  is  a,  and  each  point  J5123  is  b  -fold  on  it,  we  have  the  equation 

a  =  46  +  2. 

In  the  reverse  transformation,  the  primal  L\  meets  c  in  seven  pairs  of  points, 
corresponding  to  the  intersections  of  a  general  prime  through  JB6  with  the 
conical  sheet  of  chords  from  BQ.  Also,  L\  contains  those  points  such  as 
^123  whose  suffix  does  not  contain  the  number  6.  Since  there  are  ten  such 
points,  we  have  the  equation 

4a  =  14+106. 
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These  equations  yield  a,  =  6,     6=1. 

Hence  c  is  of  order  six;  its  intersections  with  the  prime  TT  are  simple  points. 
From  the  mode  of  generation  of  c,  we  deduce  that  the  three  points  in  which 
qt  meets  p  are  double  on  c. 

The  above  transformation  has  thus  shown :  the  double  surface  S  is  of  order 
seventeen,  has  sevenfold  points  at  the  multiple  points  of  the  threefold  V,  and 
contains  simply  the  lines  joining  pairs  of  those  points ;  8  meets  the  plane  p  in 
a  sextic  curve  with  three  double  points. 

It  is  worthy  of  remark  that  the  recalculation  of  these  results  for  8,  using 
only  the  curves  (a),  (b),  (c)  of  §4,  shows  that  those  curves  are  simple  on  S. 
From  such  verification,  we  also  deduce  that  the  section  of  V  by  the  solid  TT 
is  not  exceptional,  though  n  is  in  a  special  position. 

6.  Other  properties  of  the  double  surface 

In  the  three-space  representation  of  V,  let  the  conical  surface  which 
maps  S  (see  §  3)  and  has  vertex  at  P,  be  of  order  x.  This  cone  meets  nO10, 
the  curve  corresponding  to  the  section  by  the  general  plane  t,  in  (i)  the 
seventeen  pairs  of  points  which  lie  on  the  seventeen  trisecants  through  P 
of  the  curve,  (ii)  the  5; -fold  vertex  P,  and  (iii)  the  twenty  points  (Nijk), 
since  in  [4]  the  planes  represented  by  those  nodes  intersect  S  in  curves 
which  are  components  of  simple,  reducible  members  of  (q).  Thus  we  have 
the  equation  10*  =  2  x  17  +  *  + 20. 

Whence  x  =  6,  so  that  the  double  surface  is  represented  by  a  sextic  cone. 
The  cone  has  three  double  generators,  namely,  the  meets  by  pairs  of  the 
planes  from  P  osculating  C. 

With  such  properties  of  8  available,  it  is  of  interest  to  find  the  primal  of 
lowest  order  in  [4]  which  will  meet  V  in  S.  Since  in  [3]  the  conical  sheet 
representing  S  is  a  sextic,  and  since  the  surface  representing  a  general 
prime  section  is  a  quintic,  we  begin  with  a  surface  of  order  thirty,  which  has 
P  sixfold  and  the  points  (N^k)  twelvefold,  and  contains  the  sextic  cone. 
Residual  to  this  cone,  the  surface  has  order  twenty-four  and  has  elevenfold 
points  at  (Nijk).  As  a  special  surface  of  this  type,  we  may  take  four  times  the 
system  of  planes  (%).  Such  a  surface  represents  the  section  of  V  in  [4]  by 
a  primal  of  order  six  which  has  fourfold  points  at  (A  i),  has  their  joins  double, 
and  contains  (once)  all  the  twenty  planes  represented  by  (A^&).  Hence  the 
primal  meets  V  in  a  (double)  surface  of  order  J(6  x  9  —  20)  =17.  We  note  also 
that,  since  there  are  ten  of  the  exceptional  planes  through  any  one  of  (A^^ 
these  points  have  multiplicity  on  the  double  surface  of  J(6  x  4—  10)  =  7; 
and  since  there  are  four  exceptional  planes  through  each  join  A^A^  (which 

Z2 
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is  double  on  the  sextic  primal  and  triple  on  F)  this  line  on  the  surface  has 
multiplicity  £(3  x  2  -  4)  =  1. 

Thus  our  conclusion  is:  on  the  threefold  V,  the  double  surface  S  is  cut  out 
by  a  sextic  primal  which  has  fourfold  points  at  the  six  sixfold  points  of  V. 

7.  "  Representation"  on  a  Segre  cubic  primal 

By  the  method  of  representing  curves  by  points,*  we  can  map  on  a 
Segre  ten-nodal  cubic  primal,  K  say,  the  threefold  V  and  the  surface  S. 

Let  s  be  a  fixed  solid  through  the  points  AI}  AZ)  A3  in  [4].  Each  curve  of 
(q)  meets  s  in  one  further  point  which  we  take  to  "represent"  that  curve. 
A  double  infinity  of  the  curves  (q)  lie  on  each  member  of  the  fourfold  infinity 
of  cubic  primals  which  have  nodes  at  the  six  points  (Af).  Thus  each  such 
cubic  is  represented,  in  this  sense,  by  the  trace  on  s  of  the  oo2  curves  of  (q) 
which  lie  on  that  primal.  These  intersections  form  a  cubic  surface  which 
has  nodes  at  Alt  A2,  A3  and  simple  base  points  at  the  (three)  meets  with  s 
of  the  lines  joining  pairs  of  A^  A5,  AB.  The  system  of  all  such  cubic  surfaces 
represents  the  prime  sections  in  another  four-space  say  [4]'  of  a  ten-nodal 
cubic  primal,  which  we  take  as  the  above  locus  K. 

We  have  seen  that  the  plane  p  is  met  by  a  double  infinity  of  curves  of  (q) 
forming  the  threefold  V.  These  curves  are  represented  in  [4]'  by  the  points 
of  a  rational  surface  which  has  a  triple  point,  Q^  say,  corresponding  to  the 
unique  curve  ql  which  is  trisecant  to  p.  Bronowski  has  shown  (1,  15)  that 
this  surface  is  traced  on  K  by  a  cubic  cone  which  has  vertex  at  Qlt  contains 
the  ten  nodes  of  K  and  touches  its  fifteen  planes.  Thus,  the  threefold  V, 
regarded  as  an  aggregate  of  quartic  curves,  is  represented  on  the  Segre  cubic 
primal  K  by  a  surface  of  order  nine  traced  on  K  by  a  cubic  cone  with  vertex  at 
the  point  representing  the  triple  curve  qr 

Similarly,  in  the  present  special  sense,  the  double  surface  S  is  mapped 
on  K  by  the  oo1  points  corresponding  to  the  curves  of  (q)  which  meet  the 
sextic  curve  c.  But  c  meets  in  eighteen  points  each  of  the  six-nodal  cubic 
primals  whose  intersections  with  s  represent  the  prime  sections  of  K.  Thus 
on  K,  S  is  mapped  by  a  curve  c18  of  order  eighteen,  which  naturally  lies  on 
the  novenic  surface  corresponding  to  p.  Also,  c18  lies  on  the  polar  quadric 
of  Qi  in  regard  to  K.  For,  from  §  9  of  (1 ),  we  know  that  on  K  three  collinear 
points  QI}  Q2,  Q3,  say,  represent  three  members  of  (q)  which  are  associated 
curves.  Consider  those  points  of  K  which  when  joined  to  Q±  are  coincident 
intersections  of  K  with  lines  from  Q±.  Such  points,  representing  curves  of 
(q)  which  meet  p  twice,  clearly  lie  on  the  polar  quadric  of  Qlt 

*  See  Babbage  (5);  also,  in  detail,  Bronowski  (1). 
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Thus,  the  double  surface  S,  regarded  as  an  aggregate  of  quartic  curves,  is 
represented  on  the  Segre  ten-nodal  cubic  primal  K  by  a  curve  of  order  eighteen^ 
traced  on  the  surface  representing  the  threefold  V  by  the  polar  quadric  of  the 
point  representing  the  triple  quartic  curve  q^ 
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ON  THE  AVERAGE  NUMBER  OF  REAL  ROOTS 
OF  A  RANDOM  ALGEBRAIC  EQUATION  (II) 
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1.  In  a  paper  under  the  same  title,  which  appeared  several  years  ago,* 
I  proved  that  the  average  number  of  real  roots  of  the  random  algebraic 
equation 


^  =  0, 
with  normally  distributed  independent  coefficients,  is  asymptotically 

2 

-logn.  (1) 

I  have  also  stated  that  the  same  conclusion  holds  if  the  X's  are  uniformly 
distributed  or  assume  only  the  values  1  and  —  1  with  equal  probabilities. 

Upon  a  closer  examination  it  turns  out  that  the  proof  which  I  had  in 
mind,  based  on  the  central  limit  theorem  of  the  calculus  of  probability,  is 
inapplicable  to  the  discrete  case,  i.e.  when  the  coefficients  assume  the 
values  1  and  —  1  only.  However,  the  proof  is  applicable  to  the  case  of 
uniformly  distributed  coefficients  and  to  a  rather  wide  class  of  continuous 
distributions.  This  situation  tends  to  emphasize  the  particular  interest  of 
the  discrete  case,  which  surprisingly  enough  turns  out  to  be  the  most  difficult. 

It  may  be  mentioned  that  Littlewood  and  Offord,t  who  were  the  first  to 
consider  the  problem  of  the  average  number  of  real  roots  of  algebraic 
equations,  in  all  of  the  three  cases  mentioned  here  (i.e.  normal,  uniformly 
distributed,  discrete),  have  found  them  to  be  essentially  equivalent.  It 
should  be  emphasized  that  our  method  yields  an  essential  refinement  of  the 
Littlewood-Offord  estimate  in  the  continuous  cases,  but  fails  to  yield  any 
significant  result  in  the  discrete  case.  In  fact,  so  far  as  the  discrete  case  is 
concerned,  the  Littlewood-Offord  estimate  is  the  only  available  one. 

*  Bull.  American  Math.  Soc.  49  (1943),  314-320.   Referred  to  later  as  paper  I. 
t  Proc.  Cambridge  PhiL  Soc.  35  (1939),  133-148. 
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Since  the  simplicity  with  which  the  result  was  obtained  in  the  case  of 
normally  distributed  coefficients  was  to  some  extent  misleading,  and  since 
the  method  proved  inapplicable  to  the  discrete  case,  it  seemed  worth  while 
to  show  how  the  proof  is  carried  out  in  the  case  of  uniformly  distributed 
coefficients.  Although  the  fundamental  idea  is  the  same  as  that  used  in  the 
case  of  normally  distributed  coefficients,  the  actual  execution  is  incom- 
parably more  tedious. 

2.    We  start  with  a  general  theorem  which  is  of  independent  interest. 

THEOREM  1.  If  f(x)  has  a  continuous  derivative  in  (a,b)  which  vanishes 
only  at  a  finite  number  of  points,  then  the  number  of  real  roots  off(x)  in  (a,  6)  is 
given  by  the  formula 


i  J-  1  (  £  cos  mx))  '  f(x]  '  dx]  ^* 


It  is  to  be  understood  that  multiple  roots  are  to  be  counted  only  once, 
and  that,  if  either  a  or  6  is  a  root,  it  is  to  be  counted  as  \. 

For  the  proof  of  this  theorem  (which  is  an  almost  immediate  consequence 
of  Dirichlet's  discontinuous  integral)  the  reader  is  referred  to  §  3  (pp.  612- 
613)  of  my  note  "On  the  distribution  of  values  of  trigonometric  sums  with 
linearly  independent  frequencies".* 

THEOREM  2.    Let 

X0+Xlx  +  X2x2+...+Xn_1xn-l  =  0  (3) 

be  an  algebraic  equation,  and  let 

XQy  Xlt  X2,  ...,  Xn_! 

be  independent  random  variables  each  having  o~(u)  as  its  distribution  function. 
Then  the  average  number  of  roots  of  (3)  falling  within  (a,  b)  is  given  by  the 
formula 


x  ^  (*n  />«**)  -  *  fn  p(**fi+  fa*-1*)  +  n  Wi-  fa 

y  u=o  u=o  fc*=o 

J  +  oo 
exp  (i£u)  dcr(u) 
-00 

is  the  characteristic  function  of  the  distribution  function  cr(u). 
*  American  J.  of  Math.  65  (1943),  609-615. 
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This  theorem  follows  from  Theorem  1  by  considerations  entirely  analogous 
to  those  given  in  §§  4  and  5  of  my  note.* 

If  we  assume  that  the  coefficients  of  (3)  are  uniformly  distributed  in 
(-1,1),  we  have  silU 

PW-—  • 

Thus  for  n  ^  2  the  order  of  the  integrations  can  be  changed  and  we  may  write 

)^,  (4) 


where 


(5) 

U=0  fc=0  /) 


_oo_oo 

Inasmuch  as  the  total  average  number  of  real  roots  is  easily  seen  to  be  four 
times  the  average  number  of  real  roots  falling  within  (0,  1),  it  is  sufficient 
to  investigate  Nn(Q,  I). 

Remark.  In  the  case  of  uniformly  distributed  coefficients,  we  have 

p(t)  =  e^\ 


and  <bn(x)  can  be  evaluated  explicitly.  The  resulting  expression  for  A^fO,  1) 
is  then  identical  (except  for  the  factor  4)  with  formula  (3)  of  my  paper  I. 

3.   Henceforth  we  shall  assume  the  coefficients  of  (3)  to  be  uniformly 
distributed  in  (  -  1  ,  1  ),  i.e.  sm  ^ 

/>(*)=—. 

However,  we  shall  use  p(t)  as  an  abbreviation  for  sin£/£.  This  will  simplify 
the  appearance  of  many  formulae  and  will  also  indicate  the  extent  to  which 
our  considerations  can  be  generalized.  The  principal  result  of  this  paper 

is  that  ....  i 

tfn(0,l)~- 

We  simplify  future  considerations  by  first  noting  that,  if  0  <  a  <  1,  we  have, 

(6) 


This  follows  immediately  by  observing  that,  as  T&-»OO,  0>n(x)  approaches 

*)  -  f  +c°  f  +c°i(  n  rt**0-i(  n 

J-oo  J-oo  TU=0  U-0 

*  I  take  this  opportunity  of  correcting  the  following  minor  errors  in  my  earlier 
paper:  In  the  third  formula  of  §4  replace  C  by  20.  In  §6,  exp{-w2/2)  and 
exp  (—  Ava/2)  should  be  replaced  by  exp  (-wa/4)  and  exp  (—  Avz/<4)»  and  the  final 

formula  should  read  ,,,,    /  \      ft   /i          /     »  9V  ^ 

E(b  <Jn)  =  2  ^/A  exp  (  —  6a). 

Also  on  p.  610,  line  16  from  top,  the  words  "is  the  set  ff"  should  be  replaced  by 
"is  contained  in  the  set  H". 
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and  that  the  convergence  is  uniform  for  0  ^  x  <  a.  Thus  (4)  implies  that 

limJVn(0,a)  =  -L  r®v(x)dx, 
»->oo  *n  Jo 

and  this  in  turn  implies  (6). 

This  simple  argument  shows  that,  "on  the  average",  the  real  roots  in 
(0,  1)  can  be  expected  to  cluster  about  1  and  that  it  is  therefore  sufficient 
to  prove  that 

^(a,l)~—  logrc. 

4.  We  introduce  the  following  abbreviations  which  will  be  used  in  the 
sequel: 

n  —  1  n—  1  Ti—1  T) 

»     _    V  T2fc         r     —    V  £2«.2(fc-D         D      —  -  n 

Gn  ~   2-i  *     >      ^n.  —    2j  *  ^  >      -^  i*  — 


/» 

-f 

J 


oon—  1 

n 

O    fc-0 


We  obviously  have  Fn(n)  — 

Introducing  in  (5)  the  new  variables 


we  obtain  easily 


that  is  «.(*,  =  Jf"  2g-(0)-gf-Q"(-^^.  (8) 

The  changes  of  the  order  of  the  integrations  are  justified  by  the  fact  that  the 
integrands  are  absolutely  integrable  for  n  ^  2.  In  what  follows  absolute 
constants  will  be  denoted  by  the  letter  7  with  subscripts. 

5.    By  elementary  computations  we  obtain 
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The  following  results  will  be  needed: 
(a)  If  0  <  a  <  1  and  e  >  0,  we  have 


(9) 

T    j      j  "V     n         ^J\^n\'^)f 

Indeed,  ..,......-.—  =  __ ~-t 

where      Pn(x)  =  — ±5 — —         il^n* ~   ^     -a;    a     • 

But  P?i(tf)<l+z2<2, 

j   u  V(Pn(^))          V2 

and  hence  *- „ —  <  —^ — . 

1—  x2        l—x 

Thus  (9)  follows. 

In  the  same  way  we  prove  that 

I                     "TTT"^  ~  0(1°8W)-  (^a) 

Noticing  that,  for  1 -— r-  ^  x2  ^  1 , 

we  have  T»~  ~  ^(w)> 


we  obtain 

/  VU-3/(n~l)}  > 

Moreover 


/;, 


=  I  ^r~-^/(l  —  I^)dx—  I 

J  a  V-"n  J  a 

e 
h 

/: 


and  since 


we  have 


fVU-3/(n-l)}    //7  /• 

^V(l-^)^< 

JV(l-n-e)         V^/i  J* 
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The  reasoning  of  §  5  of  paper  I  yields 

f  VU-3/(n-l)}     in 


f 
Jo 

f  VU-3/(n 

27i)< 

JV(l-n-~<) 


and  therefore 

f  VU-3/(n-l)}     in 


Consider  the  function      L(y)  =  yn~z  [n(l  —  y)  4- 2/]2. 

It  is  easy  to  verify  that  L(y)  is  an  increasing  function  for 

0<w<  1 , 

y  n-T 

and  that  therefore,  for  xz  <  I  —  - — - , 

we  have  L( 


(9c) 
(6)  If  1  —  n~e  <x2^lt  then  a  constant  yx  exists  such  that 

Bn>7ln*.  (10) 

This  is  a  trivial  consequence  of  the  definition  of  Bn. 

(c)  If  l-~<a;2<l  --  ?-, 

v  '  ne  n—l 

then,  for  sufficiently  large  n,        +    *  <  —  .  (11) 

V^n        ^ 

We  have  by  an  elementary  transformation 

_  1  -  a2"  +  x2  -  x2n~z  [n(  I  -  xz)  -f  a:2]2 
n  "  (l-^2)1  ' 

/jg  I  ~x2 

and  hence  -  7/^-n-  —    /(-,      ^  ,  ^  ,  (12) 


where 


Hence,  for  1 ;<#2<1 ^,  (13) 

ne  n—l 

we  have 
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Since  the  right-hand  side  of  this  inequality  approaches  the  positive  number 

I-16 
1      e*' 

as  n  ->oo,  we  see  that  for  sufficiently  large  n  and  x  satisfying  the  inequalities 
(13)  we  have  Qn(x)>0. 

This  when  combined  with  (12)  gives  (11). 

(d)  For  x  satisfying  the  inequality  (13)  and  n  sufficiently  large  we  have 

xk    y* 

<"  (       ' 


where  y2  and  y3  are  absolute  constants.  In  fact 

I        I    __  3 


It  is  easily  seen  that  for  0  ^  x  ^  1 

and  Jjl'rX^1- 

Since  we  have  seen  that  for  sufficiently  large  n,  Qn(x)  >  0  the  proof  of  (15) 

follows  with  ya  =  2. 

6.  We  shall  need  various  estimates  ofFn(7j)  and  Gn(rj)  for  different  ranges 
of  if)  and  x.  These  estimates  are  presented  in  the  form  of  nine  lemmas,  all 
of  which  are  reminiscent  of  the  proof  of  the  central  limit  theorem  of  the 
calculus  of  probabilities.  To  make  the  reading  easier  we  have  divided  the 
lemmas  into  groups,  presenting  each  group  in  a  separate  section. 

LEMMA  1. 

p+co    / u\\k        f+co      /   u*\       A/6 

lim  I         p  I  — rr  I     d/u  =  I       exp  ( 1  du  =  —, — . 

k— >ao  J  -co         \V    /  I  J  —  co  \         **  /  \7T 

This  result  is  well  known,  but  I  am  unable  to  supply  an  exact  reference. 
It  follows  from  Lemma  1  that  a  constant  y4  exists  such  that,  for 

wehave  f+«     (U\\* 

P(T^ 

J  —  oo         \'\K/  i 
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LEMMA  2.  //  J  <  x  ^  1  and  n  >  3,  then  a  constant  y5  exists  such  that 


I  <  TV 


Let  us  first  assume  that     i  <  x  <  1 . 

z  n—  I 

I 


Since 
we  have 


I  P(U)  I 


i-l, 


n 

oofc=0 


where  m(x)  =  [(1  —  x)~l]  and  [t]  denotes  the  greatest  integer  not  exceeding  t. 
By  Holder's  inequality  we  have 


r  +  oomte)        /   xk  kxk-l 

n  p  77r£+~7r 

J-coA-O         W^w  V°n 


and  since 


J-fo 
_. 


m(:rM-l 


we  see  that  the  right-hand  side  of  (17)  is  equal  to 

-  f  +  °°|  /)(%)  I^ 

'  J  -co 


But 

and  since 

we  see  that  a  constant  ye  >  0  exists  such  that  for  \  ^  x  <  1  we  have 


(18) 


Thus  the  right-hand  side  of  (17)  is  less  than 


du. 


Since  x  >  |,  we  have  m(x)  >  2  and  hence,  by  (16), 

j: 
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Also  v    n       = V2V  '^ < 

and  we  see  that  the  conclusion  of  Lemma  2  holds  with 


(19) 


If 


the  proof  proceeds  in  exactly  the  same  way,  except  that  instead  of  m(x) 
we  use  n—l.  Instead  of  (18)  we  have 

n-l       U/w  /          1\K»-1) 

>77>0, 


and  instead  of  (19)  ^L_2  <  i. 

It  may,  of  course,  be  assumed  that  y6  =  y7. 

7.    It  is  easily  seen  that  numbers  a,  (0  <  a  <  1)  and  i 
that  for  n>n0  and  a  ^  x  ^  1  we  have  simultaneously 


(n-l  \2 

S^-1    , 
i=l          I 


,  can  be  chosen  such 

(20) 

(21) 

We  can,  of  course,  assume  that  a  >  \. 

In  what  follows  a  will  always  have  the  meaning  just  described  and  it 
will  always  be  assumed  that  n>n0.  Although  we  make  the  assumption 
a  ^  x  ^  1  in  Lemmas  3,  4  and  5,  we  do  not  utilize  the  property  (20).  Only  in 
the  proof  of  Lemma  9  we  make  full  use  of  (20).  The  reason  for  insisting  on 
the  assumption  a  ^  x  ^  1 ,  even  in  lemmas  in  which  it  is  not  essential,  is  to 
establish  a  range  of  values  of  x  for  which  the  four  lemmas  quoted  hold 
simultaneously. 


LEMMA  3. 


and  |  T\  |  <  T\>a  we  have 


/•» 

J  l-4VJS 


(22) 


Since  |  p(u)\  ^  1  and  [*JBn~\ 


i  —  1  we  have 


n 


Using  the  stronger  inequality  |  p(u)  |  <  1/|  u  |,  we  have 
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399 


we  obtain 
and  therefore 


£+- 


>a*l* — - 


n 


We  have 


II  a;*  >  «B» 


and  since  a?  >  a  it  follows  that  #V(i-z2)  js  bounded  from  below  by  a  positive 
constant. 

Remembering  that  *JBn  >  5  we  see  that  the  conclusion  of  Lemma  3  holds. 


LEMMA  4.    // 


1 

- 


there  exists  a  number  N(Q,  e)  depending  on  Q  and  e,  such  that,  far  n  >  N(Q,  e), 
the  conclusion  of  Lemma  3  holds. 


we  have 


By  making  n  sufficiently  large  we  see  by  the  use  of  (11)  that  the  inequality 

\j  Bn  .,      oc 

V^      To 

can  be  made  to  hold.  The  rest  of  the  proof  is  identical  with  that  of  Lemma  3. 
LEMMA  5.  For  a  < x  ^  1 ,  |  y  \  <  T^  and  0  ^  £  ^  l-4^/J5w,  we  have 

RP\~7w£  +  ~7r~y}  <  exP  {  ~  °°^  ' "  (£2  +  ZRnfy  +  ^2)}>          (23) 
where 


Rn  — 


From  the  assumptions  of  the  lemma  it  follows  that 
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Since  |pM|  =  -  i     t  •  =  1  — -^cos#,     0<^<|w|, 

we  have  for  |  u  \  <  1*5 

u2        ,  _  f     cos  1-5 


U*          ,    *  f 

-  — cosl-5<exp| 


Thus 

n-l 


l_£°l.L^MvV-^-^    ^fc~1  \21 


and  the  conclusion  of  Lemma  5  follows,  in  view  of  the  definitions  of  Bn 
and  Cn  in  §  4.  It  follows  at  once  from  Schwarz's  inequality  that  Rn^l. 

LEMMA  6.  There  exists  an  integer  N(P,  Q,  e)  such  that,  for  n  >  N(P,  Q,  e)  and 

1-4<*2<I-;dh'  M<«>  p>0' 

we  have  P< 


|fi-4V«.n--i    /  xk         fak-i   \  rco  /     cosl- 

Dp  bir£  +  --/7r-?r£  <         exP  --  « 

UP  fc=0     WAt  VC7i     /  JP-Q          (  b 


The  use  of  (10)  and  (15)  shows  that  a  number  N(P)  Q,  e)  can  be  chosen  so 
that,  for  n  >  N(P,  Q,  e),  we  have  simultaneously 


and  hence  -^-  |T?|<~. 

Thus  the  reasoning  by  means  of  which  (23)  was  established  is  applicable, 
and  (23)  is  seen  to  hold. 
Hence  we  have  a  fortiori 


l-4:VBnn-l     I   rfk  J»~k-l    \  /*co  (      on^l- 

W-^-g  +  ^U^W  <f    exp    -^ 

P  fc=0     W^w  VCw      /  J-P  I  b 


Now 


f00  (COS  1-6  -J,,.        f00  ( 

exp  —  —  -  g2Ug<  exp 

JP+R»1  \       b  )  JP-Q  I 

The  first  and  last  steps  are  justified  by  the  inequality 


which  is  an  immediate  consequence  of  Schwarz's  inequality. 
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8.   LEMMA  7.  //  |  u  \  <  J,  we  have 


In  fact,  for  |  u  \  <  \TI  we  have 

,   .       sin  u      sin  \ 


401 


_ 


Thus 

,  . 

and  l 

On  the  other  hand,  if 

we  see  that 
and  hence  that 


0  <z< 


3—  /5 
—    — 


-  - 


+  z  +  z«  <  exp  (z  +  2*). 


Thus 

and 

If\u\<fc  we  see  that 

_    .        _  /       u*\          uz      \u 

and  therefore        log(l  -  -)  >  -~-^- 

This  completes  the  proof  of  Lemma  7. 


logy---<z  +  z* 

log  (  1  —  z)  >  —z  —  z*. 

—  <  —  ~-  , 


^__ 


LEMMA  8.  There  exists  a  number  M(P,  Q,  e)  such  that,  for  n  >  M(P,  Q,  e)  and 


1.  (25) 

The  number  M (P,  Q,  e)  is  chosen  in  such  a  way  that,  for  n  >  M(p,  Q,  e)f 
the  inequalities  (14)  and  (15)  hold  and  also 

T2     r>    ,      7s    r\    ,  1 
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Thus,  for  n  >  M(P,  Q,  e),  we  have 

ryk  7*~k-l 

Js  I     >*   I  n"*/  I  I  1 

-77rl£  +~7r~  ?   <i 

\-°n  VCn 

and  an  application  of  Lemma  7  together  with  the  definitions  of  Bn,  Cn,  Dn, 
and  Rn  yields 

a  \       n-l 

-fi 


where 
Thus 


We  now  observe  that  (14)  and  (15)  imply  that 


and  this  completes  the  proof  of  Lemma  8. 

9.    LEMMA  9.    A  constant  y10  exists  such  that,  for  a<#^l,  n>nQ  and 
\y\<  ro  a>  we  have 

--  (26) 


From  Taylor's  theorem  it  follows  immediately  that 


where  ^  is  a  number  between  —  ^  and  0  and  #2  a  number  between  0  and  ?). 
It  is  enough  to  prove  then  that  with  the  conditions  of  the  lemma  we  have 


From  the  definition  of  On(n)  given  in  §  4  we  have 


J  o        \V      n  V    n 


TTIT 

n 


where,  in  the  double  sum,  j^pl  and  the  prime  and  double  prime  on  the  pro- 
duct signs  indicate  respectively  the  omission  of  one  term  and  two  terms. 
We  first  note  that  a  constant  yn  exists  such  that 
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Thus 


The  reasoning  of  Lemma  2  is  applicable  since  an  omission  of  one  term  from 
the  product  has  almost  no  bearing  on  the  proof.  We  see  then  that  a  constant 
y12  exists  such  that 


(27) 


This  fact,  together  with  the  definition  of  (?„,  implies  immediately  that 


n  —  l  j  2^.20'— 1)  /»  oo 
7=0        Cn       Jo 

The  estimate  of  the  double  sum  is  considerably  more  involved.  We  write 

/•oo  fi-4VB«     r°° 

//p'ir<zg  =  + 

Jo  Jo  Ji- 

Since  |  p'(u)  \  ^  2,  we  have 

I  p'p'WdE, 

We  may,  of  course,  assume  that  n  ^  6  so  that  <JBn  ^^Jn<n  —  3  and  we  obtain 
(since  \p(u)  |  <  1) 

/•oo  /»oo 

Jl-4VB   'n'l<i£<Jl^B    tljs 


The  estimate  of  the  right-hand  side  proceeds  now  exactly  as  in  the  proof  of 
Lemma  3,  except  that  the  integral 


and  the  product 


n 


encountered  in  proving  Lemma  3  may  have  to  be  replaced  by  the  integral 

£  1  \-CVBnl4-2 


J 


and  the  product  U"  xk 

respectively. 

Since  for  x  >  a  and  n  >  nQ  we  have  \_^Bn]  ^  5,  the  final  estimate  is  of  the 
same  form  as  in  Lemma  3  except  that  y8  should  be  replaced  by  a  different 
constant  y13.  Thus 

(28) 


AA2 
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In  order  to  estimate  the  integral 

Jo 
we  first  notice  that,  since  0  <  £ <  l*±^Bn  and  |  y  |  < ^a  < ^,  we  have 

xk         kxk~~* 


It  is  easy  to  verify  that,  for  |  u  \  < 


and  we  can  write 


<rf4 

Thus  by  Lemma  5 


-£+7-17r 


and  therefore 


J  o  I 


Use  has  been  made  of  the  fact  that 

^  e^&-i 


jx*~* 


From  (28)  it  follows  that 


(29) 


n-l  \2 


(30) 
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In  deriving  (30)  use  has  been  made  of  the  trivial  fact  that  Cn  >  I  and  of 
formula  (20).  This  is  the  only  place  in  the  whole  proof  where  formula  (20) 
is  utilized  and  where  the  restriction  x  >  a  is  really  necessary. 
Combining  (27),  (29)  and  (30)  we  obtain 


and  the  conclusion  of  Lemma  9  follows  with  y10  =  y12  +  y15  4-  4y13. 

10.   We  are  now  ready  to  prove  the  principal  result  of  this  paper,  namely 
that  ___      l 

2n 

As  we  observed  in  §  3  it  will  suffice  to  prove  that 

I 

2zr 

/*oo  r>n    (f\ 

T  t»t  VF  (T\  — 

jueu  T«iic)  — 


so  that  N&1)  =      .      jL  *rn(x)  dx. 

We  now  establish  the  following  two  estimates: 
I.    For  a<x^l,  n>n0  we  have 


II.    For  !_It  <«.<!__!_  (32) 

and  e±  >  0  we  can  find  a  number  K(e,  ex)  depending  on  e  and  el  such  that, 
for  n  >  K(e,  e^)  and  x  satisfying  (32),  we  have 

|Tn(*)-7rV(l-^)|<ei.  (33) 

Proof  of  I.  We  write 


+  2f«   2Q>(o)-gny-gn(-y) 

Ja/10  ^  ' 


and  estimate  the  first  integral  by  the  use  of  Lemma  9  and  the  second  integral 
by  the  use  of  Lemma  2. 

We  obtain  |  ¥„(*)  I  <  2Tlo  ^  +  8y5  ^  , 

and  hence  I  holds  with  y16  =  iay10  -h  80y5/a. 
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Proof  of  II.  Let  0  <8  <  ^ 

*„(,)  =  2  p«q.(o)- 

Jo 
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By  Lemma  9  we  have  I  A  I  <  2<tyio> 

and  by  Lemma  2  |  /3  1  <  8#y5. 

To  estimate  the  integral  72  we  investigate  the  difference 


where  #„(?/)  =  f  °°  exp  {  -  $ 

Jo 

and  ?;  ranges  between  d  and  1  /£. 

It  foUows  from  Lemma  8  (with  P  = 
and  |  ?;  |  <  1  /(^  we  have 


)}  dg, 


(34) 
(35) 


(36) 


,  Q  =  I  /$)  that  for  n  >  M(2/d,  1  /<J,  e) 


f2l8n-\ 

n/ 

Jo    &-o 


-l     \  /»2/ 

-V)«- 

«    /         Jo 


Applying  Lemma  6  with  P  =  2/tJand^  =  l/£weseethatforn>  AT(2/£,  l/8,e) 
and  a;  in  the  range  (32)  we  have 


and 


I    /ml-4v/Sw»-l 

n 

\J21S  fc.O 


/»o 
Jl 


(38) 


Now  by  Lemma  4  with  Q  =1/3  we  have  for  w  >  N(1/S,  e),  a;  in  the  range  (32), 
and    TJ    <  l/d 


JMVB  *?o' 


TIP 


/oq\ 
(     } 


Finally,  for  |  ^  |  <  1/<J  and  regardless  of  x  and  ft,  we  have  the  inequality 

J  2/ 


2/o 
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If  now  n>MB,x{M(2/d,  l/£,e),  N(2jd9  l/tf,e),  N(l/8,€)}  =  v(d,e)  and  x  satis- 
fies (32),  by  combining  (37),  (38),  (39)  and  (40),  we  see  that,  for  |  y  \  <  1/3, 
we  have 


Thus  for  n  >  v(8,  e)  and  x  in  the  range  (32)  we  have 

2       J*  ^  ^ 

8f2/         (27yQ)        \  y1ft  ^  f00          (     cos  1-5, 


An  elementary  computation  shows  that 


It  is  also  easily  verified  that 


(42) 


uniformly  for  0  <  a;  <  1  (and  hence,  a  fortiori,  for  a;  in  the  range  (32)).  Com- 
bining (34),  (35),  (41)  and  (42),  we  see  that  for  x  in  the  range  (32)  and 
n  >  v(S,  e)  we  have 


Now,  with  e  and  el  given,  we  first  choose  Sfej)  (depending  on  ex)  in  such  a 
way  that  cos  1-5 


With  ^  =  ^(G!)  thus  chosen,  we  find  a  number  i>i(tf(ei),e)  depending  on  dfa) 
(and  hence  on  ex)  such  that,  for  w  >  ^(^Ci),  e),  we  have 


Putting  JT(€,  ex)  =  Max  {^(8(6-^,  e),  ^(^(e!),  e)}  we  complete  the  proof  of  (33). 
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The  proof  of  the  main  result  follows  now  very  easily.  In  fact,  we  have 

/V(i-n-«)    in 


and  hence,  by  (31)  and  (33)  and  for  n  >  K(ey  €}), 


^(         ^  ^  , 

V^/t  277  J  V{l-3/(w-l 


Making  use  of  (9).  (9  a),  (9fe)  and  (9c)  we  complete  the  proof  by  observing 
that  e  and  e{  are  quite  arbitrary. 
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THE  DlFFERENTIABLE  PARAMETRIZATION  OF  A  SURFACE 

By  A.  J.  WARD 
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Let  O  be  any  point  of  a  two-dimensional  manifold  in  three-dimensional 
Euclidean  space,  so  that  some  neighbourhood  S  of  0,  taken  closed,  is  the 
topological  image  of  a  closed  square.  This  neighbourhood  may  be  para- 
metrized in  an  infinite  number  of  ways,  the  coordinates  x,  y  and  z  being 
expressed  as  functions  flt  /2  and  /3  of  real  variables  u  and  v  in  the  range 
|  w  |  <  1,  |  v  |  <  1,  where  the  correspondence  between  (u,  v)  and  (x,y,z)  is 
one-one.  We  shall  say  that  the  parametrization  is  differentiable  at  O  if,  at 
the  corresponding  values  of  u  and  v,  the  f  unctions  fv/2  and/3  are  all  totally 
differentiable  and  the  Jacobians  d(fi,fj)ld(u,v)  are  not  all  zero.  It  is  clear 
that,  if  any  such  differentiable  parametrization  exists,  we  may,  by  simple 
transformations  of  the  variables,  suppose  that 

§&  =  ^  =  1       ^  =  ^  =  ^?  =  ^  =  0. 

du       dv         '      dv      du      du       dv 

The  problem  discussed  in  this  paper  is  that  of  finding  a  necessary  and 
sufficient  condition,  involving  8  only  as  a  set  of  points,  for  the  existence  of 
a  parametrization  differentiable  at  a  given  point  O.  It  is  clear  that  a  neces- 
sary condition  is  the  existence  of  a  tangent  plane;  that  is  a  plane  (which  we 
may  take  to  be  z  =  0,  O  being  the  origin)  such  that  z/V(#2  +  y2  +  z2)  ->  0 
uniformly  as  ^/(#a-f-^a-f-2a)->0,  for  (x,y,z)  lying  in  8.  This  condition,  how- 
ever, is  easily  seen  not  to  be  sufficient.*  A  further  condition  arises  from  the 
fact  that,  if  the  derivatives  are  normalized  as  above,  any  curve  in  S  near  O 
differs  but  little  from  its  pattern  curve  in  the  (u,  v)  plane,  or  rather  from  the 
congruent  curve  obtained  by  putting  x  =  u,  y  —  v,  z  =  0.  We  have,  in  fact 
(if  there  is  a  differentiable  parametrization),  the  following  result. 

There  exists  a  function  i/r(r),  which  ->0  as  r->  +  0,  with  the  following  pro- 
perty. Let  P  be  any  point  of  S  and  C  a  simple  closed  curve  on  S,  such  that  P 


*  Cf.  the  corresponding  case  of  a  curve.  A.  J.  Ward,  Fundamenta  Math.  28  (1937), 
280-288. 
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and  all  points  of  C  lie  within  r  of  O,  and  let  P,  G  be  the  orthogonal  projections 
of  P,  C  on  any  plane.  Then  there  exists  a  deformation  C'  o/C  (lying  in  the  same 
plane)  in  which  each  point  moves  by  at  most  pi/r(r),  where  p  is  the  distance  from 
O  of  the  corresponding  point  of  (7,  such  that  C'  does  or  does  not  surround  P 
according  as  C  does  or  does  not  surround  P  on  S.  (If  C  passes  through  O,  the 
plane  of  projection  must  not  be  perpendicular  to  OP.*) 

We  call  this  condition  A.  It  is  comparatively  easy  to  show  that  condition 
A,  together  with  the  existence  of  a  tangent  plane,  is  sufficient  for  the  exist- 
ence of  a  parametrization  differentiable  at  O.  The  main  result  of  this  paper, 
however,  is  that  condition  A  itself  involves  the  existence  of  a  tangent  plane, 
and  thus  is  by  itself  sufficient  for  a  differentiable  parametrization. 

Two  further  problems  naturally  suggest  themselves;  first,  to  find  con- 
ditions for  the  existence  of  a  parametrization  which  is  differentiable  at  all 
points  of  a  given  set,  and  secondly,  to  extend  the  result  to  manifolds  of 
higher  dimensions.  It  appears  unlikely  that  the  elementary  methods  used 
in  this  paper  would  be  adaptable  to  this  last  problem. 

1.  Notation  and  preliminaries.  We  suppose  0,  S  given  as  in  the  Intro- 
duction, satisfying  condition  A.  (x,y,z)  are  coordinates  in  the  space  in 
which  S  lies  and  (u,  v)  the  parameters  of  any  one  parametrization  of  S  by 
continuous  functions,  assumed  given. 

Italic  capitals  P,  C,  ...  denote  points  or  curves  on  S,  Clarendon  letters 
P,  C,  ...  their  projections  on  some  plane,  and  Greek  letters  such  as  H,  F,  ... 
the  corresponding  points  or  curves  in  the  (u,  v )  plane.  Curves,  surfaces  or 
volumes  with  no  immediate  relation  to  $  are  denoted  by  Roman  capitals. 
As  usual,  r,  3  and  e  denote  (small)  positive  numbers.  We  shall,  when  there  is 
no  risk  of  confusion,  use  the  symbol  ±  to  indicate  bounds  of  approximation; 
for  instance,  if  we  say  that  a  distance  is  r  ±  e  we  mean  that  it  lies  between 
r  —  e  and  r  +  e. 

We  shall  often  have  to  consider,  as  in  condition  A,  a  deformation  of  a 
projected  curve  C  in  which  each  point  moves  by  at  most  ep,  where  e  is  given 
and  p  is  the  distance  from  O  of  the  corresponding  point  on  S.  We  shall  call 
this  for  brevity  an  ep  deformation,  pointing  out  once  and  for  all  that  in  this 
notation  p  does  not  represent  a  constant. 

If  P,  Q  are  any  two  points  of  S  and  A ,  B  any  two  points  in  the  space,  we 
shall  say  that  P  and  Q  may  be  connected  within  S  of  AB  if  there  exists  in  S 
an  arc  joining  P  and  Q,  all  points  of  which  lie  within  d  of  the  straight  seg- 

*  Since  C  is  a  simple  closed  curve,  the  question  whether  C  surrounds  P  on  S  is  a 
purely  topological  one.  C  and  G'  need  not,  however,  be  simple.  Let  £,?/  be  rectangular 
coordinates  in  the  plane  of  projection,  with  P  as  origin.  We  say  that  C'  surrounds  P 
if  arg(£+t'7;)  changes  when  (£,77)  describes  G'  once. 
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ment  AB.  The  statement  that  P  and  Q  may  be  connected  within  $  of  AB 
produced  is  interpreted  similarly,  with  reference  to  the  infinite  straight  line 
through  A  and  B. 

We  shall  often  have  to  consider  the  part  of  S  cut  off  by  a  convex  volume 
V,  which  includes  a  point  P  of  S  but  no  point  of  the  boundary  of  S.  To 
simplify  topological  considerations  we  shall  always  suppose  the  (u,  v) 
plane  divided  into  extremely  small  squares.*  Those  squares  corresponding 
to  parts  of  8  lying  entirely  inside  V  will  form  one  or  more  closed  domains, 
one  of  which,  say  A,  will  include  II,  the  pattern  of  P.  The  boundary  of  this 
domain  A  will  consist  of  one  or  more  simple  closed  curves:  if  there  are  more 
than  one,  then  one,  which  we  shall  call  the  outer  boundary,  must  enclose  all 
the  others.f  The  corresponding  curve  on  8  we  shall  call  the  outer  boundary 
of  the,  component,  determined  by  P,  cut  off  from  S  by  V.  Strictly,  we  should 
specify  the  subdivision  of  the  (u,  v)  plane  into  squares,  but  it  is  clear  that  we 
can  make  it  fine  enough  to  ensure  that  all  points  of  this  outer  boundary 
(on  S)  lie  within  a  given  distance  $  of  the  boundary  of  V.  We  shall  then  denote 
this  outer  boundary  (of  the  component  determined  by  P)  by  B(P;  V,d)9 
and  the  part  of  S  interior  to  B(P;  V  ,d)  by  8(P;  V,£).  (We  remark  that 
$(P;  V,#)  need  not  lie  entirely  inside  V.)  As  a  particular  case,  if  V  is  a 
sphere  of  centre  P  and  radius  r,  which  we  shall  denote  by  K(P;  r),  we  shall 
write  B(P;  r,  d)  and  £(P;  r,  S)  for  B(P\  V,  d)  and  S(P;  V,  <f ). 

We  now  state  three  lemmas  which  follow  almost  immediately  from 
condition  A: 

LEMMA  1 .  If  P  is  inside  C  and  all  points  of  C  lie  within  r  of  O,  then  P  lies 
within  r[l  +  &(%?)]  of  0,  provided  that  fi(2r)  <  1. 

It  is  sufficient  to  prove  that  no  point  P  inside  C  lies  exactly  at  distance 
r[l  +  ^(2r)]  from  0.  For  this  we  have  only  to  project  C  on  to  a  plane  through 
0  and  P,  and  apply  condition  A. 

LEMMA  2.  Under  the  conditions  of  lemma  1,  if  P,  C  are  projections  on  to 
any  plane,  then  P  lies  inside  C  or  within  ri/r(2r)  of  G. 

This  follows  at  once,  since  OP  <  2r  by  lemma  1.  We  remark  that  we  may 
put  C  =  B(O]  r,  d),  and  P  any  point  of  8(0;  r,  d). 

LEMMA  3.  Let  P,  Q  be  any  two  points  of  8(0  \  r,er),  where  ft(2r)  <e<  J. 
Then  either  P,  Q  may  be  connected  within  3er  of  PQ  produced,  or  each  of  P 
and  Q  may  be  connected,  within  3er  of  PQ  produced,  with  a  point  P' ,  Q' 
respectively  whose  distance  from  O  is  r±  er. 

*  It  is  easily  proved  that  curves  and  domains  made  up  from  a  network  of  squares 
have  those  topological  properties  which  appear  intuitive. 

t  It  may  have  a  finite  number  of  points  in  common  with  any  of  the  other  parts  of 
the  boundary  of  A . 
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We  take  for  V  the  common  part  of  K(O;  r)  and  an  infinite  cylinder  H  of 
axis  PQ  and  radius  2er.  Suppose  first  that  Q  is  in  S(P\  V,  er),  so  that  P,  Q 
can  be  joined  by  an  arc  lying  entirely  in  this  domain.  The  projection  of 
B(P\  V,  er)  on  to  the  plane  through  P  perpendicular  to  PQ  lies  entirely 
within  2er  of  P.  Hence,  by  lemma  2,  all  points  of  the  arc  PQ  have 
projections  within  3er  of  P,  so  that  the  arc  itself  lies  within  3er  of  PQ 
produced. 

The  alternative  is  that  Q  is  outside  1?(P;  V,  er).  We  say  that  in  this  case 
there  is  a  point  P'  of  B(P\  V,  er)  at  distance  r  ±  er  from  O.  The  above  argu- 
ment will  then  apply  at  once  to  PP',  and,  by  symmetry,  also  to  Q  and  some 
point  Q'. 

If  no  such  point  P'  exists,  all  points  of  B(P\  V,er)  must  lie  within  r  —  er 
of  O  and  within  er  of  the  boundary  of  H.  The  projection  of  B(P\  V,  er)  on 
to  the  same  plane  as  before,  will  not  now  approach  within  er  of  P.  An 
ep  deformation  cannot  therefore  alter  its  inclusion  relation  with  P,  and 
since  P  is  inside  B  it  must  lie  inside  the  projection  of  B.  But  P  is  also  the 
projection  of  Q  lying  outside  B,  so  that  we  have  a  contradiction. 

2.  Our  fittst  task  is  to  show  that  the  part  of  8  within  r  of  O  lies  within  er, 
say,  of  some  plane,  for  all  sufficiently  small  r.  We  take  first  a  comparatively 
easy  case,  when  (to  speak  roughly)  S  fails  to  fill  completely  any  small  sphere 
round  O.  Later  we  shall  show  that  the  other  case  is  in  fact  impossible;  this 
occupies  §§  3  and  4. 

THEOREM  I.  Let  e  <  0-1  be  given,  and  r  such  that  ^r(2r)  <  10~3e2.  Suppose, 
that  there  exists  a  point  P  of  S(O;r,  er)  with  OP  ^  Jr,  and  a  line  L  through  P, 
such  that  P  cannot  be  connected  with  any  point  of  8  at  distance  r±er  from  O 
by  a  curve  within  3er  of  L.  Then  all  points  of  S(O;  r,  er)  lie  within  £er  of  some 
fixed  plane,  and  so  within  er  of  some  plane  through  O. 

Take  P  as  the  origin  and  L  as  the  z-axis.  We  remark  first  that  any  point 
Q  =  (#1?  ylt  0)  satisfying  ^(x\  +  y\ )  ^  er  must  be  the  projection  of  some  point 
(xi»  y\>  z\)  °f  &•  ^or>  ^aking  V  as  the  common  part  of  K(O,  r)  and  the  cylinder 
^(cc2-fy2)^2er,  we  see  by  the  data  that  B(P\  V,  er)  contains  no  point  at 
distance  r  ±  er  from  0,  and  hence  (as  in  lemma  3)  that  P,  and  so  Q,  must  be 
inside  the  projection  B.  But  B  can  be  continuously  shrunk  to  a  point  in 
$(P;  V,  er),  and  as  B  shrinks  to  a  point  it  must  at  some  stage  pass  through 
0-  We  remark  that  the  point  Q  so  obtained  may  be  connected  with  P 
within  3er  of  L. 

Consider  then  a  point  Q  —  (er,  0,  z±)  of  8.  Applying  lemma  3  with  e 
replaced  by  el  =  10~3e2,  we  see  that  P  may  be  connected,  within  3ejr  of 
PQ  produced,  either  with  Q  or  with  a  point  distant  r  ±  e±r  from  O.  In  either 
case,  writing  zx  =  ker,  we  see  that  there  exists  an  arc  PQ±  say  in  S,  which 
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lies  within  3e1r  of  the  segment  Lx  say,  namely,  |  x  \  ^  er,  y  —  0,  z  =  kx,  and 
whose  end-point  QL  satisfies  |  x  \  —  er.  By  a  similar  argument  there  exists 
an  arc  PR±  lying  within  Se^r  of  the  segment  L£,  namely,  x  =  0,  |  y  \  <er, 
z  =  k'y  say,  whose  end-point  El  satisfies  |  y  \  =  er.  For  simplicity  we  shall 
suppose  that  x(Q±)  =  y(R1)  —  +  er. 

Now  let  Vj  be  the  set  of  points,  distant  at  most  r  from  0,  which  lie  within 
Se^  of  the  plane  strip  #  ^  0,  ?/  ^  0,  x  +  y  —  er.  By  our  data  (since  Qe1  +  e<  3e), 
we  see  that  B(Q1;  V1,e1r)  cannot  contain  any  point  distant  r±er  from  O. 
Thus  B(Ql-J  V-^r)  lies  entirely  within  exr  of  the  "walls"  of  Vx  parallel  to 
the  z-axis  L.  Its  projection  B  on  the  plane  z  =  0  is  therefore  distant  at  least 
46!  r  from  the  segment  x^Q,  2/^0,  x  +  y  —  er,  and  so  at  least  exr  from  the 
segment  joining  the  projections  Ql5  Rt.  By  an  argument  like  that  of  lemma  3, 
B  must  therefore  surround  QI,  and  so  Rt;  whence  B  surrounds  both  Ql  and 
/?!,  so  that  Qi  and  R±  may  be  connected  in  S(Q1'J  Vj,  e-^r) ;  that  is  (by  lemma  2), 
by  an  arc  lying  within  6exr  of  the  plane  x  +  y  =  er. 

We  now  have  a  closed,  not  necessarily  simple,  curve  PQ1R1P.  This  will 
contain  a  simple  closed  curve  P2$27?2P2,  or  C  say,  where  P2QZ  lies  within 
3exr  of  LI}  C2-K2  within  6etr  of  the  plane  x  +  y  =  er,  and  -/?2P2  within  3e^r 
of  Lj.  Consider  now  two  triangular  prisms  V2,  V3;  namely,  x  ^  4e!r,  i/  ^  4e!r, 
a;  +  y ^ er  —  7 ^26! r ,  and  as ^ 5ex r ,  y^Se^r,  x  +  y^er  —  8 ^2e-^ r  respectively . 
By  projecting  on  to  the  plane  z  =  0  we  see  that  any  point  X  of  S,  lying  in 
V2  and  within  r  of  O,  is  inside  C.  Now  project  on  to  a  plane  perpendicular 
both  to  the  plane  x  +  y  —  0  and  also  to  the  plane,  H  say,  which  contains  the 
segments  Lx  and  LJ.  Any  point  of  C,  and  therefore  any  point  of  its  pro- 
jection, lies  within  3e±r  of  H  or  within  6exr  of  the  plane  x  +  y  =  er.  It  follows 
that  the  point  X,  whose  projection  must  be  inside  some  e±p  deformation 
of  the  projection  of  C,  must  lie  within  te^r  of  H. 

Consider  in  particular  the  point  T,  say,  of  S(P',  V,er),  for  which 
x  =  y  —  Jer,  and  form  B(T\  V^,e±r).  By  the  usual  argument,  we  see  that 
any  point  X  inside  or  on  this  curve  is  certainly  in  V2  and  within  r  of  O, 
and  hence  within  le^r  of  H;  we  note  also  that  any  point  of  B  is  distant  at 
least  Jer  —  Gejr  from  T7.  Now  let  X  be  any  point  of  S  outside  B  but  within 
2r  of  0.  Project  on  to  a  plane  through  T  perpendicular  to  XT.  Since  T  is 
inside  and  X  outside  B,  there  must  exist  one  e1p  deformation  of  the  pro- 
jection B  which  surrounds  T,  and  another  which  does  not;  hence  B  must 
pass  within  e^r  of  T.  A  point  of  B  within  exr  of  T  must  be  at  most  &evr 
from  H  and  at  least  Jer  —  le^r  from  the  corresponding  point  of  B.  It  easily 
follows  that  the  angle  a  between  XT  and  H  must  satisfy  the  relation 
( £e  —  le^)  sin  a  <  9ex.  The  distance  of  X  from  H  is  then  at  most  3r  sin  a  +  46J/, 
which  is  clearly  less  than  Jer  since  e^  —  10~3e2.  Thus  the  theorem  is 
proved. 
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3.  We  next  examine  the  consequences  of  supposing  that  the  conditions 
of  theorem  I  are  not  satisfied. 

THEOREM  II.  Suppose  that,  for  given  e  and  r,  S(O;  r,  er)  does  not  lie  entirely 
within  er  of  any  plane  through  O,  and  that  ^(4r)  <  e±  =  10~9e4.  Take  any  ortho- 
gonal axes  Ox,  Oy,  Oz.  Then,  for  any  N  such  that  2Ne  <  J,  there  exist 
2Narcs  C'i(x),  C'i(y),  i  =  1,  2,  ...,N,  on  S  such  that 

(i)  each  arc  runs  from  O  to  a  point  of  B(O\  r,  er),  no  two  arcs  meeting 
except  at  O  ; 

(ii)  an  arbitrarily  small  simple  closed  curve  can  be  drawn  on  S,  surrounding 
O,  to  meet  these  arcs  in  the  cyclic  order  C'[(x),  C'[(y},  Cz(x),  ...,  C'^(x),  C"N(y)  ; 

(iii)  Cl(x)  lies  entirely  within  (2N  -h  1  )  er  of  Ox  and  C'i(y)  within  (2N  +  1  )  er 
ofOy. 

We  state  first 

LEMMA  4.  Suppose  that  P,  Q  are  connected  by  two  arcs  PR±  Q,  PR%  Q  lying 
respectively  within  a  distance  d  of  two  perpendicular  planes,  say  x  —  0,  y  =  0, 
and  within  r-f-er  of  O.  Then  P,  Q  may  be  connected  by  an  arc  lying  within 
8+  3er  of  each  plane,  if  i/f(4r)  <  |e  <  J. 

It  is  clearly  sufficient  to  consider  the  case  when  the  given  arcs  are  simple 
and  do  not  meet  except  at  their  end-points.  Projecting  on  to  the  plane  z  =  0, 
we  see  that  all  points  of  S  within  the  simple  closed  curve  C  —  PRl  QR2  P 
satisfy  |  x  \  ^  S+  er  or  \y\  ^8+er.  As  in  lemma  1,  they  also  satisfy 


Now  take  for  V  the  set  of  points  within  2r  of  O  satisfying  |  x  \  <  d  +  2er. 
Then  $(P;  V,  er)  will  certainly  include  all  points  of  PRl  Q  but  not  in  general 
all  points  of  PR%  Q.  It  will  therefore  cut  off,  from  the  domain  inside  C,  a 
component  bounded  by  the  arc  C:  ~  PR1Q  and  an  arc  C2  consisting  of 
portions  of  PR2Q  and  portions  of  B(P;  V,er)  lying  inside  C.*  Now  all 
points  inside  or  on  B(P\  V,  er)  satisfy  |  x  \  ^5+3er,  and  all  points  inside 
or  on  G  satisfy  min  (  |  x  \  ,  \y\)^8  +  er.  Further,  all  points  of  B  satisfy  either 
*J(x2  +  y2  +  z2)  >2r  —  er  or  |  x  \  >#-f  er;  for  those  points  inside  C  the  former 
is  impossible  and  the  latter  implies  \y\^d+er.  It  follows  that  C2  satisfies 
our  requirements.  We  note  further  that  if  PRL  Q  and  PR2  Q  lie  in  the  region 
a^z^b,  then  all  points  inside  or  on  C,  and  so  all  points  of  <72,  satisfy 
a  —  er  ^  z  ^  b  +  er. 

We  now  turn  to  the  main  proof  of  theorem  II.  We  may  suppose  e<  J. 
It  is  clear  that  the  conditions  of  theorem  I,  with  r  replaced  by  2r  and  e  by 

*  In  general  there  might  be  an  infinite  number  of  such  portions,  and  also  a  closed 
set,  formed  by  their  limit  points,  common  to  (7  and  B(P;  V,  er).  If,  however,  our 
curves  are  all  made  up  of  a  finite  number  of  stretches  given  by  u  —  constant  or 
v  =  constant,  there  can  be  only  a  finite  number  of  such  portions  to  consider. 
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e'  =  10~3e2,  cannot  be  satisfied.  Hence  any  point  P  of  S(O ;  r,  er)  with  OP  <  r 
can  be  connected,  within  6e'r  of  any  line  L  through  P,  with  some  point 
distant  2r  ±  2e'r  from  O,  and  so  certainly  with  some  point  of  B(O\  r,  er).  In 
particular,  we  can  connect  O  itself  with  points  of  B  by  two  arcs  C0(x),  C0(y) 
lying  within  6eV  of  Ox,  Oy  respectively.  If  we  replace  6eV  by  lOe'r,  we  may 
further  suppose  that  these  arcs  do  not  meet  except  at  O.  For  let  Q  be  the 
last  point  on  C0(x)  where  it  meets  C0(y).  By  lemma  4,  there  is  an  arc  K  =  OQ 
of  which  all  points  satisfy 

max  ( |  x  | ,  \y\,  \  z  \ )  ^  6eV  +  6e1r. 

We  replace  the  arcs  OQ  of  C0(x),  CQ(y)  respectively  by  the  arc  K.  If  this 
introduces  closed  loops  (for  example,  if  CQ(y)  meets  K  again  beyond  Q)  we 
eliminate  these  and  are  left  with  two  arcs  having  a  certain  part  K^  in 
common  and  not  meeting  again.  We  finally  replace  Kx  by  two  separate  arcs 
very  nearly  coinciding  with  it,  and  thus  obtain  two  separate  curves  (which 
we  shall  still  call  C0(a?),  C0(y))  satisfying  the  required  inequalities. 

Let  us  suppose  for  definiteness  that  the  end-point  of  C0(y)  has  y  positive. 
Then  let  Px  be  the  first  point  of  CQ(y)  with  y  =  er.  We  may  connect  Px  with 
a  point  of  B  by  a  curve  (\(x)  lying  within  6eV  of  the  line  y  =  er,  z  =  z(P1), 
and  so  within  16eV  of  y  —  er,  z  =  0.  As  above,  we  may  suppose  that  C-^(x) 
does  not  meet  C0(y)  again,  if  we  modify  CQ(y)  as  may  be  necessary,  and 
replace  lOeV  for  C0(y)  and  16eV  for  G\(#)  by  24eV  in  each  case.*  Since 
24e'  <  |e,  the  modification  cannot  cause  C0(y)  to  cut  CQ(x),  neither  can  C0(x) 
meet  C±(x).  Again,  the  part  of  C^(y)  for  which  y  >  |e  is  still  within  lOeV  of  Oy. 

Now  let  P2  be  the  first  point  of  the  modified  CQ(y)  satisfying  y  =  2er. 
We  construct  similarly  an  arc  from  P2  to  B,  lying  within  24eV  of  y  =  2er, 
z  =  0,  which  we  may  again  suppose  not  to  meet  C0(y)  except  at  P2.  We 
proceed  similarly  up  to  P2N  obtaining  arcs  C^x),  C2(x),  ...,C2N(x). 

On  each  curve  C^x)  we  take  the  first  point  Qi  such  that  \x\  —  ier  and 
connect  it  with  a  point  of  B  by  a  curve  Cf(y)  lying  within  6eV  of  x  =  x(Qi), 
z  =  z(Q^.  We  may,  as  before,  suppose  that  C^y)  does  not  meet  Ct(x)  again, 
if  we  modify  C^x)  and  replace  the  24eV  for  C^x)  and  the  6eV  for  C^y)  by 
36eV.  As  before,  this  cannot  introduce  intersections  with  curves  already 
constructed,  and  no  two  C.t(y)  can  meet.  Then  C^y)  lies  within  60eV  of  the 
line  x  —  ±  ier,  z  =  0. 

We  next  prove  that  no  two  curves  C^x),  Cj(y)  (i^=j,  j=}=0)  can  meet. 
Suppose,  if  possible,  that  they  do;  let  R,  say,  be  the  first  point  on  C^x) 
where  it  meets  C^y).  Then  PiPjQjRPi  is  a  simple  closed  curve  C  whose  four 


*  Since  16e'-f  6ex<  lie'  and  17V2<24.  It  may  be  remarked  that  the  case  is  a 
little  more  complicated  than  the  preceding  one,  as  C^(x)  may  meet  CQ(y)  in  points  on 
either  side  of  Pj. 
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portions  lie  within  60eV  of  the  straight  lines  x  =  03  y  =  jer,  x  =  ± jer, 
y  =  ier,  all  in  the  plane  z  =  0.  Just  as  in  the  last  part  of  §  2,  it  now  follows 
that  any  point  of  S  inside  C  is  within  61eV  of  the  plane  2  =  0;  while  if  P  is 
any  point  of  8(0;  r,  er)  outside  <7,  and  a  the  angle  between  the  plane  z  =  0 
and  the  line  joining  P  to  the  midpoint  of  the  rectangle  formed  by  the  above 
straight  lines,  then  (|e  —  60e')  r  sin  a  <  62e  V.  Thus  every  point  of  S(O ;  r,  er)  lies 
within  Sr.  I22e'/e  of  z  =  0.  Since  e'  =  10~3e2this  contradicts  our  assumptions. 
Now  the  curves  C^x),  together  with  the  part  of  C0(y)  from  0  to  P^, 
divide  S(O;  r,er)  into  (2^-f  1)  sectors.  From  what  has  just  been  proved, 
each  curve  C^ (y)  (j  3=  0)  lies  entirely  in  one  sector.  No  sector  can  contain 
more  than  two  Cj(y)y  namely,  those  originating  from  the  two  C^x)  which 
bound  the  sector.  Hence  at  least  N  sectors  must  contain  a  curve  C$(y). 
We  pick  one  curve  C^(y)  from  each  of  N  such  sectors,  and  renumber  them 
in  the  order  in  which  they  are  met  in  traversing  B(O\r,  er),  as  C[(y),  C'2(y), 
•••>  @N(y}-  Between  each  two  curves  C'^y),  C'j+l(y)  lies  at  least  one  curve 
CL(x),  forming  the  boundary  of  the  sector  containing  Gj(y).  We  pick  out, 
for  each  j,  one  such  x  curve  and  call  it  C'i+1(x).  Similarly,  we  have  C'±(x) 
between  C#(y)  and  C[(y)* 


*  For  the  sake  of  clearness  we  give  a  diagram  illustrating  a  possible  topological 
arrangement  on  S  of  the  curves  considered,  taking  N  =  2. 

Here,  for  example,  C((y)  could  be  either  G^y]  or  C4(2/),  and  C'z(y]  could  be  C$(y) 
or  Cz(y).  If  we  take  C((y)  =  C^y)  and  C'^y)  =  Cs(y),  then  C'2(x)  may  be  either  C^x) 
or  C3(x),  and  Cfa)  either  C2(x),  C0(x)  or  C^x). 


C2(x) 


PI 


p* 


p* 


B  =  B(0;r,er) 
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Starting  from  0,  we  now  clearly  have  2N  curves  which  satisfy  the  con- 
ditions laid  down,  except  that  they  coincide  for  a  certain  distance  from  0. 
We  can  avoid  this  by  replacing  any  portion  of  arc,  forming  part  of,  say, 
p  curves,  by  p  nearly  coincident  arcs.  We  thus  have  the  required  curves 


4.  We  now  show  that,  for  sufficiently  small  r,  the  conditions  envisaged 
in  theorem  II  cannot  in  fact  arise.  That  is,  we  can  prove: 

THEOREM  III.  Given  any  e  >  0,  then,  for  all  sufficiently  small  r,  S(O;  r,  er) 
lies  within  er  of  some  plane  through  0. 

COROLLARY.  Given  e  >  0,  then,  for  any  sufficiently  small  r,  any  point  of  8 
which  lies  within  r  of  O  is  within  er  of  some  fixed  plane  through  O. 

Let  us  assume  the  theorem  false.  We  may  clearly  suppose  e<  0-0001. 
Let  r0  be  such  that  ^r(32r0)  <ex  =  10~3e2.  Consider  B(O\  r0,  er0);  this  curve 
can  contain  at  most  a  finite  number,  say  N,  of  non-overlapping  arcs  each  of 
diameter  not  less  than  ^r0.  We  take  now  e2  such  that  (2N~\-  I)e2<e1,  and 
write  e3  =  10~9ef.  By  our  assumption,  there  exists  r±  so  small  that  fr(4ri)  <  e3 
while  $(0;  F^erj),  and  a  fortiori  S(O;  T^t€^r^*  is  not  entirely  within  6zr1 
of  any  plane  through  0.  We  take  any  orthogonal  axes  through  0. 

Letnowrn  =  4^h-ltSn  =  S(O;rn,ern),3,ndBn  =  B(O;rH,ern)(n  =  1,2,...). 
We  now  say  that,  so  long  as  i]s(8rn)  <  e^  Sn  contains 

(a)  a  point  for  which  \z\>  Jrn,  \x  ,  \y\<  0-lrw; 

(b)  2N  non-intersecting  arcs  Cf(x),  Cf(y)  leading  from  0  to  Bn  [arranged 
so  that  in  traversing  Bn  they  are  met  in  the  order  C%(x),  C^(y),  C$(x),  ..., 
Cjv(y)],  such  that  all  points  on  Cf(x)  satisfy 


\y\<  16er7l,      |  z  | 
while  all  points  on  Cf(y)  satisfy 


In  particular,  it  will  follow  that  the  end-points  X$,   Y?  of  C?(x),  C$(y) 
satisfy  |  x  \  >  0-8rM,  |  z  |  <  0-2r/t  and  |  y  |  >  0-8r,ft,  |  z  \  <  0-4rw  respectively. 

The  above  statement  is  true  for  n  =  1  ;  part  (a)  follows  from  theorem  I 
(according  to  which  it  is  possible  to  connect  0  with  a  point  distant  rL  ±  er± 
from  0  by  a  curve  within  3erx  of  Oz)  and  part  (6)  from  theorem  II.  Assume 
that  the  statement  is  true  for  n  =  k.  Then  Sk,  and  so  Sk+1,  is  not  entirely 
within  erk+1  of  any  plane  through  0.  Just  as  above,  this  proves  part  (a) 
for  rk+l  by  theorem  I. 


*  We  may  clearly  suppose  that  S(O;r,  S{)  includes  S(O;r,  #a)  if  dl  <  8Z.  This  remark 
will  be  used  again  without  explicit  reference. 
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Consider  now  any  end-point  X  $  =  (#0,  yQ,  z0)  say.  Suppose  first  that 
z0  >  0.  Let  L  be  the  line  joining  X\  to  the  point  (0,  y0,  0-lrk).  By  theorem  I, 
there  exists  an  arc  in  jS,  lying  within  3erk+2  of  L  produced,  which  joins  X$ 
to  a  point  distant  rk+2(I  ±  e)  from  O.  The  part  of  this  arc  in  Sk+l  joins  X$  to 
a  point  of  Bk+l1  which  we  take  for  X%+1.  Let  C%+l(x)  be  made  up  of  C\  (x) 
together  with  this  arc  X^X^1,  cutting  out  any  closed  portions  so  as  to 
obtain  a  simple  arc.  We  see  at  once  that,  throughout  Oj+1(#), 


Now  since  |  x0  |  ^  0-8rfc  and  0  <  z0  <  0-2rfc,  the  z/#  slope  of  L  is  numerically 
less  than  0-15.  Hence,  at  all  points  of  X$  X^+1}  and  therefore  throughout 
C$+l(x)t  we  have 

|  z  |  <  0-lrfc  +  0-15  (|  #  | 


The  conditions  laid  down  are  therefore  satisfied  for  O^+1(x).  If  zQ<0  we 
take  L  as  the  line  joining  X*  to  (O\yQ,  —  Olrfc).  For  (7*+1(y)  we  work 
similarly,  interchanging  a;  and  y,  but  replacing  Q-lrk  by  O^r^.  We  find  again 
that  the  required  conditions  are  satisfied. 

We  now  have  to  prove  that  no  two  of  our  curves  meet.  For  topological 
reasons,  since  C$(x)  lies  in  a  sector  of  Sk  bounded  by  C*_i(y),  C]l(y),  and  J3k, 
it  is  sufficient  to  prove  that  C$+1(x)  cannot  meet  C]t+l(y)  or  C^l(y).  Con- 
sider, for  example,  C$+l(x)  and  C^(y).  We  know  that  C%(x),  C%(y)  do  not 
meet.  All  points  of  C*+l(y)  satisfy  \x\<  16erfc+1.  Any  point  of  the  arc 
XiX$+1  satisfying  |  x  |  <  I6erk+l  also  satisfies  the  inequalities  \y\<  l^erfc+1 

1  1  z  |  -  0-lrfc  |  <  0-15  (|  x  |  +  3erfc+2)  -f  3erfc+2 
<  0-03rfc 

(since  e<  0-0001).  Similarly,  any  point  of  Fj  FJ+1  for  which  |  y  |  <  16er&+1 

must  satisfy 

11*1-0-^  |  <0-04rfc. 

It  is  thus  clear  that  these  two  arcs  cannot  meet.  Again,  any  point  of  Cf\y) 
for  which  \y\<  16erk+l  will  satisfy 


so  that  X$  X%+1  cannot  meet  C?(y).  Similarly,  Ff  Ff+1  cannot  meet 
It  follows  that  no  two  of  the  curves  CJ+1  can  meet. 
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By  induction,  our  statement  is  therefore  true  for  all  n  such  that  ^(8rn)  <  e1? 
and  in  particular  for  n  such  that  rn_1  <  r0  ^  rn.  Now  the  paths  Cf(x)9  Cf(y) 
meet  B(O;  r0,  erQ)  in  2N  points  Pi}  Q^  say  arranged  in  the  order  Pl  Qj^P2  Q2  . . . . 
For  Pt,  we  have  \y\<  lGern  <  64er0  and  \z\<  0-04r,t  +  0-15  \x\<  0-35r0,  and 
therefore  |  x  \  >  0-7r0,  since  OPt  >  r0(l  —  e).  Similarly  Qi  satisfies  the  relation 
|  y  |  >  0-7r0.  Thus  each  arc  P^  Qt  or  Q^P^x  is  of  diameter  greater  than  Jr0,  and 
so  we  have  a  contradiction.  Thus,  the  main  theorem  cannot  in  fact  be  false. 

To  prove  the  corollary , .we  remark  that  if  S(O\  rt  er)  lies  entirely  within 
er  of,  say,  the  plane  z  —  0  (taking  axes  through  0),  then  B(O\  r,  er)  must  lie 
within  2er,  at  most,  of  the  circle  xz  +  y2  —  r2,  z  =  0.  Projecting  on  to  the 
plane  z  —  0,  we  see  that  any  point  of  S  within  |r  of  O  must  lie  inside  B 
(provided  that  i/r(r)  <  |  —  2e)  and  therefore  within  er  of  z  =  0.  We  obtain 
the  corollary  as  stated  by  writing  2r  and  \e  for  r  and  e. 

5.   THEOREM  IV.  There  exists  a  tangent  plane  at  O. 

Let  H(r,  e)  denote  any  plane  through  O  such  that  all  points  of  S  .K(O,r) 
lie  within  er  of  H(r,  e).  Given  e,  which  we  may  suppose  less  than  0-001,  we 
can  find  8  such  that,  for  all  r  <  d,  H(r,  e)  exists  and  also  ^r(2r)  <  e.  We  remark 
first  that,  arguing  as  in  the  proof  of  theorem  I,  we  can  show  that  any  point 
of  H  within,  say,  0-8r  of  O  must  be  the  orthogonal  projection  of  some  point  of 
S .  K(0,  r).  Again,  if  P,  Q  are  any  two  such  points,  and  P,  O  their  projections 
on  to  H,  we  may  take  as  the  volume  V  a  cylinder,  with  generators  perpendi- 
cular to  H,  surrounding  the  segment  PQ  at  a  distance  2er.  It  is  easily  seen 
that  P  and  Q  must  both  lie  inside  B(P;  V,  er)  and  can  therefore  be  connected 
by  an  arc  lying  in  $(P;  V,  er).  Such  an  arc  lies  in  V  or  within  er  of  V,  and  also 
within  er  of  H.  It  is  therefore  entirely  within  4er  of  the  segment  PQ  and 
5er  of  PQ. 

We  now  prove  that  if  0  <  rt  <  r0  ^  d,  then  H(r0,  e)  and  H(rl5  e)  are  inclined 
at  an  angle  of  at  most  2a,  where  sin  a  =  40e.  This  is  clearly  true  if  r0  ^  2rlt 
for  there  exists  a  point  of  S  distant  at  least  £rx  from  O  and  at  most  er0  from 
either  plane.  Suppose  that  the  statement  is  true  whenever  r0  <  2*rj_,  and 
consider  the  case  2krl  <  r0  <  2fc+1r1.  Take  axes  through  O  so  that  H^,  e) 
is  the  plane  z  —  —  x  tan  a  and  H(r0,  e)  is  z  —  x  tan  6,  where  —  a  <  6  <  \n  —  a. 
Using  H( 2*^,6)  as  an  intermediary  we  easily  see  that  #<3a;  we  wish  to 
prove  6<oc.  We  write  ri  =  2*-%  (i^l)  and  H^  =  H(r^e).  Let  Pi3  Qi  be 
points  of  $.K(<9,  rj  whose  projections  P<,  Q*  on  H^  satisfy  x  =  y  =  \ri  and 
x  =  —  y  —  Jr^  respectively  (i  —  0, 1, ...,  &-f  1),  and  let  R  be  a  point  of 
$.K(O, rx)  whose  projection  on  tL1  satisfies  x  =  Jrls  y  =  0.  We  can  connect 
O,  Pj  and  O,  (^  by  arcs  lying  within  4er1  of  OPlt  OQi  respectively.  We 
connect  Pi  with  Pi+l  and  Qt  with  Qi+1  by  arcs  lying  within  5eri+1  of  J^JFJ+1, 
GiGi+i  respectively  and  so  within  6eri+1  of  P^P^^i  and  OiOi+i-  Finally, 

BB2 
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we  connect  PK+I  with  P0,  P0  with  Q0,  and  Q0  with  $j.+1  by  arcs  within  6er0 
of  Pfc+iP0>  PO  QO  and  Qo  Qfc+i-  We  thus  obtain  a  closed  curve 


which  contains  a  simple  closed  curve  C  passing  through  or  near  the  same 
points. 

We  now  project  on  to  the  plane  z  =  0.  The  coordinates  of  R  satisfy 


The  projections  of  the  arcs  0-Pl5  P^P^t  etc.,  lie  within  6erls  6er2,  .  .  .  ,  of  straight 
segments  which  together  make  up  the  triangle  of  vertices  (0,  0,  0),  (^?*0,  ^r0,  0) 
and  (  Jr0,  —  Jr0,  0).  It  is  easily  seen  that  any  ep-deformation  of  the  projection 
of  C  must  surround  the  projection  of  R,  so  that  R  is  inside  C  on  S.  On  the 
other  hand,  if  we  project  on  to  the  plane  x  —  0,  the  segments  0P15  OQi 
project  on  to  the  segments 

z  —  —y  tan  a,  0<y<-|r1     and     2  =  «/tana,  Oy^—Jrj. 

The  projections  of  the  segments  PtP2)  ...,PkP0  and  QiQ2>  .-•>  Q/4:Qo  all 
satisfy  j  */  |  ^  lrlt  and  the  projection  of  P0Qo  satisfies  z  =  Jr0tan#.  It  is 
clear  that  if  0  >  0  the  projection  of  OPj^Pg  .  .  .  P0  QQ  •  •  •  Qi  O  does  not  surround 
the  projection  of  R,  and  on  calculating  the  distances  it  is  found  that,  since 
sin  a  =  40e,  if  6  ^  a  no  ep  -deformation  of  G  can  surround  this  point.  Thus 
6  <  a,  as  was  to  be  proved. 

If  now  en->0  as  n->ao  in  such  a  way  that  H(2~n,  ew)  exists  for  each  ft, 
it  follows  at  once  that  H(2~n,  en)  tends  to  a  limiting  plane  H,  which  is  clearly 
a  tangent  plane  to  the  surface. 

6.    We  can  now  prove  the  final  result: 

There  exists  a  parametrization  differentiable  at  O. 

Let  z  —  0  be  the  tangent  plane.  By  previous  results  we  know  that  every 
point  of  z  =  0  within  a  certain  distance,  say  d,  of  0  is  the  projection  of  some 
point  of  S.  Also,  there  exists  a  function  <f>(r),  tending  to  zero  as  r->0,  such 
that  any  two  points  P,  Q  of  the  surface,  whose  projections  P,  O  on  z  —  0  lie 
within  2r  of  O,  can  be  joined  by  an  arc  lying  within  r(p(r)  of  PQ-  We  may 
suppose  cj)(r)  ^  ijr(2r). 

We  now  construct  a  sequence  of  regular  polygons  Cn  in  the  plane  z  =  0, 
inscribed  in  circles  of  centre  O  and  radius  rn,  n  —  1,  2,  ....  We  prescribe  first 
that  rn+1/rn  lies  between  f  and  1,  and  increases  with  n,  while 

'n-'nl^fo-l-'n)       for 


We  can  then  choose  the  number,  kn,  of  sides  of  Cw  so  that  kn  is  a  power  of  2 
and  the  length  of  each  side  of  CM  lies  between  i(fw_i~  rn)  and  2(rn  —  rn+1), 
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,or  for  n  =  1,  between  J^  — r2)  and  2(r1  —  ra).  We  may  suppose  that,  under 
these  conditions,  kn  increases  with  n  and  &x  ^  16.  C,t  and  Cn+1  will  not  meet, 
and  the  annulus  between  them  can  be  divided  into  triangles  whose  vertices 
are  vertices  of  the  polygons  and  whose  sides  are  of  length  at  most  3(rn  —  rn+1). 
There  exist  positive  constants  A,  B  such  that,  however  rn  and  kn  are  chosen 
subject  to  the  above  restrictions,  every  side  PQ  of  any  triangle,  with  one 
end-point  in  Cn,  is  distant  at  least  A  (rn  —  rn+1)  from  any  other  side  RT  which 
it  does  not  meet,  and  every  angle  6  of  every  triangle  satisfies  cosec  ^0  <  B. 

Since  <j>(r)  ->  0  as  r->  0,  we  can  now  finally  choose  the  rn,  in  succession,  so 
that  A  (rn  -  rn+1)  >  4=(B  +  1 )  rn$(rn),  r±  ^  3,  and  rn  -»  0  and  rn+1/rn  ->  1 ,  kn  ->  GO, 
as  n-»oo.  We  thus  get  an  infinite  triangulation  of  a  certain  neighbourhood 
of  O  in  the  plane  2  =  0. 

We  now  construct  a  corresponding  division  of  S.  We  associate  with  each 
vertex  P  of  each  Cn  a  point  P  of  S  of  which  P  is  the  projection.  Consider 
now  the  various  lines  PQ,  PR,  ...,  drawn  from  P  in  the  triangulation.  Let 
P  be  in  Cm,  O  in  Cn  (n  =  ra  —  1,  m,  or  m+  1)  and  j  —  max (m, n).  On  PQ 
insert  points  Qlf  Q2  so  that  PQi  =  2Brm$(rm),  Q2Q  =  2Brn$(rn),  and  let 
Qls  Q2  be  corresponding  points  of  S.  (It  is  clear  that  PQj  and  Q2Q  do  not 
overlap.)  We  connect  P  with  Qit  Q±  with  <?2  and  Q2  with  Q  by  arcs  lying 
within  rj(p(rj)  of  PQj,  QiQ2  and  Q2Q  respectively  (this  is  possible  since 
P  and  Q  lie  within  r^_x  of  O  and  rj_l  <  2r^),  and  cut  out  any  closed  portions 
so  as  to  get  a  simple  arc  PQ.  We  suppose  a  similar  construction  carried  out 
for  all  other  straight  lines  of  the  triangulation.  In  particular,  let  PR^R^B 
be  a  similarly  constructed  arc  corresponding  to  PR.  Then,  since  the  distance 
between  the  segments  PR  and  Qx  Q  is  at  least  2Brm(/>(rm)  sin  ^QPR,  whether 
QPR  is  acute  or  obtuse,  the  arc  PR  cannot  meet  the  arc  PQ  except  possibly 
in  a  point  belonging  to  the  part  PQ1  and  similarly  to  PRl.  If  PQ±  and  PR± 
do  meet,  let  T  say  be  the  last  point  of  intersection  on  PB^.  We  may  replace 
the  part  PT  of  PB±  by  the  corresponding  part  of  PQV  We  may  work 
similarly  with  all  the  arcs  leaving  P,  so  that  any  two  may  have  an  initial 
part  in  common  but  do  not  meet  again.  By  replacing  the  common  parts  by 
two  or  more  nearly  coincident  arcs,  we  may  finally  suppose  that  none  of  the 
arcs  PQ,  PR,  ...,  meet  except  at  P.  Since  only  the  parts  PQit  PR^  •>•  have 
been  altered,  the  modified  arcs  still  lie  within  (2B  -f  1 )  rm(j>(rm)  of  PQ,  PR, .... 
We  suppose  a  similar  modification  carried  out  for  each  vertex  of  the  tri- 
angulation. It  is  then  easily  seen  that,  since  4(1?  +  1 )  rm(f>(rm)  <A(rm  —  rm+1), 
two  arcs  corresponding  to  non-intersecting  segments  do  not  meet.  Thus  no 
two  arcs  meet  except  at  a  common  vertex. 

The  arcs  corresponding  to  the  sides  of  Cn  form,  taken  together,  a  simple 
closed  curve  Cn.  Projecting  on  to  z  =  0  we  obtain  a  closed  curve  Cn  lying 
within  (2B+  l)rn<j>(rn)  of  Cn.  Thus  Gn  surrounds  any  vertex  P  of  Cn^  and 
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is  distant  at  least  2(B  +  l)rn(j)(rn)y  a  fortiori  at  least  2rni/r(2rn),  from  P.  It 
follows  that  the  corresponding  point  P  on  8  is  inside  Cn,  and  so  Gn+l  lies 
entirely  inside  Cn  on  8.  The  region  between  Cn  and  Cn+1  on  S  is  therefore 
divided  into  domains  in  one-one  correspondence  with  the  triangles  lying  in 
the  annulus  between  Cn  and  Cn+1. 

We  now  set  up  an  arbitrary  topological  correspondence  between  each  arc 
PQ  and  the  corresponding  segment  PQ.  For  each  triangular  area  PQR  say, 
there  then  exists  a  topological  correspondence  with  the  domain  PQR  on  S, 
agreeing  with  the  correspondence  already  set  up  on  the  bounding  arcs. 
On  combining  all  these,  and  making  O  correspond  to  itself,  we  have  a  one-one 
continuous  correspondence  between  the  interior  of  C±  in  the  plane  z  =  0  and 
the  interior  of  6\  in  $,  for  any  point  of  8  except  O  lies  outside  some  Cn,  and 
therefore  falls  into  some  domain  PQR  if  inside  6\.  Outside  Cj  and  Ct  the 
mapping  can  be  completed  in  any  way. 

Let  (x,  y,  0)  be  any  point  of  z  =  0,  lying  in  some  triangle  PQR  between 
Cn  and  Cn+1  say,  and  (X,  Y,  Z)  the  corresponding  point  of  S.  Since  all 
points  of  the  arcs  PQ,  QR}  HP  are  within  2rn  of  O,  the  projection  (X,  V,  0) 
must  lie  either  inside  the  projection  of  the  curve  PQRP,  or  at  a  distance  at 
most  2rw^(4rn)  from  it;  that  is,  either  inside  the  triangle  PQR  or  distant 
at  most  2rnft(4rn)  +  (2B+l)rn<fi(rn)  from  it.  The  distance  of  (#,y,0)  from 
the  sides  of  PQR  is  less  than  rn  —  rn+1  and  each  side  of  PQR  is  of  length  at 
most  3(rn  —  rn+1).  Since  <j)(rn),  ft(&rn)  and  1  —rn+ilrn  a^  tend  to  0  as 
we  see  that  x  =  x  +  o  <](x*  +  2/2),  Y  =  y  +  o  J(x*  +  y*) 


as  x,  «/->0;  hence  X,  Y  are  differentiable  functions  of  (x,  y)  at  x  =  y  —  0 
and  satisfy  the  equations 

ax_ay_1    ^_SF__O 

dx        dy         }      dy        dx 
Finally,  since  z  =  0  is  a  tangent-plane,  we  have 


so  that  Z  is  differentiable  and 

dZ      dZ 

~        —  o     at    x  =  y  —  0. 

ox       dy 

Thus  we  have  set  up  a  parametrization  which  is  differentiable  at  0. 
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1.    On  the  basis  of  the  hitherto  unproved  "extended  Riemann  hypo- 
thesis", Littlewood  (  1  )  proved  that  there  are  infinitely  many  k  such  that 


i     n         log  log  k  6  v 

where  x(n)  denotes  a  real  primitive  character  (mod  A;),  and  C  is  Euler's 
constant. 

Independently  of  each  other,  and  almost  simultaneously,  Linnik(2), 
Walfisz(3)  and  1(4)  proved  the  following  results  without  assuming  any 
hypothesis  : 

(I)  There  are  infinitely  many  k  such  that 

«  x(n)      _  A 
" 


where  A  is  a  certain  absolute  positive  constant,  and  x(n)  i8  a  real  primitive 
character  (mod  A;). 

(II)  There  are  infinitely  many  k  such  that 


where  e  is  an  arbitrary  positive  number,  and  x(ri)  is  a  real  primitive  character 
(mod  A;). 

Of  these  results  (II)  was  proved  by  me;  the  sharper  result  (I)  is  due  to 
Linnik  and  Walfisz.  I  now  find  that  a  simple  sharpening  of  my  method  used 
to  prove  (II)  will  prove  Littlewood's  result  without  assuming  "the  extended 
Riemann  hypothesis".  In  fact,  all  we  have  to  do  is  to  replace  the  number 
g  of  my  paper  (4)  by  ,-  loga. 


where  [t]  denotes  the  greatest  integer  contained  in  t. 
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As  my  paper  (4)  contains  misprints  (nor  is  it  easily  available)  I  develop 
the  whole  argument  without  any  reference  to  this  paper.  We  actually  prove 
somewhat  more  than  Littlewood's  conjecture,  namely,  theorems  1  and  2 
of  §  1  1  (towards  the  end  of  this  paper). 

2.   Definitions.  pm  denotes  the  mth  odd  prime, 


where  g  is  defined  by  (3)  below;  6  is  a  positive  integer  such  that  (b/pr)  =  —  1 
for  l<r<g>  &E=5  (mod  8)  and  !<6<8a;  x  is  a  sufficiently  large  positive 

intoger'  ,    r   log*   I.  (3) 

9-l(l3p^?J'  (3) 

(njm)  is  the  Jacobi  symbol  if  m  is  odd  and  prime  to  n,  but  is  0  in  all  other 
cases  (i.e.  when  m  is  even  or  when  m  and  n  have  a  common  factor).  We  write 

(4) 


and  *<*)  =      S        S-  (5) 

3r<n£2xm~lm\      w       / 

We  observe  (for  further  reference)  the  fact  that  we  may  write 

=  v  ^L) 
T     m    ' 


where  ^(m)  is  a  real  primitive  character  (mod  A;)  if  San~\-b  is  quadraifrei 
and  A;  =  8a?^  +  6.  We  also  note  that 


\m 
provided  m  is  odd,  which  is  the  reciprocity  law  for  Jacobi's  symbol. 

3.   We  first  prove  that     T(x)  =  S(x)  -f  O(x*).  (6) 

The  proof  of  (6)  needs 

LEMMA  1.  If  x(n)  *5  a  non-principal  character  (mod  A;),  then 


This  is  a  well-known  result  when  x(n)  IQ  a  primitive  character  (modfc); 
the  extension  (due  to  Davenport)  to  non-principal  characters  x(n)  is 
easily  made. 
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LEMMA  2.    We  have  a  <  x&>     (x  >  x0). 

Proof.  For  x  >  XQ,  log  a  <  &(pg)  <  2pg  <  30  log  0  <  3  log  a/log  log  x  <  ^  log  x- 

Now,  using  lemma  1, 

*•(.)-«(.)-     S      S 


by  lemma  2  and  the  fact  that  I  —  ---  I  is  a  character  (mod  2($an  +  b))  (since 

\     m     / 

the  symbol  is  0,  by  definition,  whenever  m  is  even).  Thus  (6)  is  proved. 
4.    We  also  need  the  following  two  lemmas: 
LEMMA  3.    The  number  of  quadratfrei  integers   San  +  b   (x<n^2x)  is 

(1  +  0(1)}*. 

Proof.  The  number  San  +  b  cannot  be  divisible  by  p*  when  l<r£g. 
Now  the  number  of  numbers  San  +  b  (x<n^  2x)  which  are  divisible  by  p; 
(r  >  g)  is  clearly  of  the  order 


-  0(X). 

Hence  we  obtain  lemma  3. 

LEMMA  4.  Let  J?(y)  denote  the  number  of  positive  integers  m  such  that 
(i)  m£y, 

(ii)  m=sp?p?...p%'     (a's^O). 

Then  F(y)<tyy>     when    y>xl, 

where  g  is  as  defined  above. 

Proof.  The  number  of  positive  integers  of  the  form  pl  (p  fixed  ^  2  ; 
t  —  0,1,2,3,...)  and  not  exceeding  7/is  clearly  less  than  2  (log  y/logp)  <  3  log  y, 
whenever  p^pg  and  x  >  x0.  Hence,  for  x  >  xQt 

Iogj%)<01og(31og2/). 

N°W  9  =  (log  log*?  "  °  ((loglogyJV 

since  x  <  y*  ;  hence          log  F(y)  =  O(log  t//log  log  y),  (7) 

and  lemma  4  follows  at  once. 
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5.    It  is  clear  from  (5)  that 


(8) 
where,  for  r  —  1,  2,  3, 

(9) 


and  m  runs  through  different  sets  of  values  (described  below)  in  the  3  sums: 
(i)  In  S^x),  m  takes  all  values  (  ^x*)  of  the  form  p*lp%*  <-P%«  (a's>  0), 
i.e.  m  is  not  divisible  by  any  prime  greater  than  pg. 

(ii)  In  S2(x)t  w  takes  only  values  of  the  form  m  =  rajWg,  where 
mi  =  Pi  'Pz9  •  •  •  Pg'  (a's^O),  while  m2  =  Q2M,  where  (m2,a)  =  1,  and  Jf  is 
quadratfrei  and  greater  than  1  (so  that  m  and  ma  cannot  be  perfect  squares). 

(iii)  In  $3(#),  m  takes  only  values  of  the  form  m^  Q2,  where 


and     (Q,a)  =  1,  6>1. 

It  is  clear  that  these  three  types  of  m  are  non-overlapping  and  exhaust 
all  positive  integers  m,  and  so  (8)  is  rendered  obvious. 

6.     In  S^x)  we  clearly  have 


\ 


(b\ 
=    —    = 

\m/ 


where  A(m)  is  Liouville's  function  (Landau,  Handbuch  der  Primzahlen,  2, 
(1909),  617)  defined  as  follows: 


A(l)  =  1,     A(w)  =  (  -  1 
where  w.  =  g{»g|'...  gf1,  and  the  #'s  are  distinct  primes  (/?'s  >  0).  Hence 


«!(*)  =  *    S,  (10) 

m^x*     W 

where  m  runs  only  through  positive  integers  of  the  form  pflp2n  -  •  •  Pg"  (each 
a^O). 

7.   In  S2(x)  we  have  m  =  m^m^  and  so 

/8aw.  +  6\  _  /8an-h6\  /8 
V     w     j  "  \MI     )  \ 


w2    / 

since  rax  is  not  divisible  by  any  prime  greater  than  pg.  Since  mz  is  not  a 
perfect  square,  we  have 


1946]        IMPROVEMENT  OF  A  THEOREM  OF  LINNIK  AND  WALFISZ  427 

where  n  runs  through  a  complete  set  of  residues  (mod  raa)  ;  and  hence 

-  o(m), 


=  o(m). 


Hence 


=  O/  £    -)  =  O(x*). 

\mgic*  Wl-/ 


(11) 


8.    In  83(x),  m  runs  through  numbers  of  the  type  ml  Q2  where  m±  is  of  the 
form  pf  lp%*  .  .  .  Awhile  (Q,a)  =  1,  Q>  1.  Hence 


s    s  - 


^    s 


and  so 

9.    From  (6),  (8),  (10),  (11),  (12),  we  get 

s     s!(? 

a;<nS2a;m=l^  \ 


10.   We  now  proceed  to  consider  the  sum 

v  A(w) 

/  j      —  —  —  ^ 

w^«s    m 
which  occurs  in  (13)  above.  We  have 

y    A(w)  =  yA(m)         y 


(14) 
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where,  in  a,(x),  ra  runs  over  all  positive  integers  of  the  form  pflp%* ...  Pg" 
(each  a^O);  in  ft(x),  m  runs  over  all  such  values  which  are,  in  addition, 
greater  than  x*.  Now 


a(x)=  f[  (1-"-  +  ^-^  +  .. 

r=l\         Pr       Pr       Pr 


^-      __  . 
4  log  log  a; 

Again, 


(15) 


(16) 


q 


where  q  runs  over  all  positive  integers  (  >xi),  arid  F(q)  is  as  in  lemma  4. 
By  partial  summation,  from  (16),  using  F(y)  <  y*t  we  have 


P(x)  =  O((a?*)*  (a;*)"1)  =  O(^-^)  (17) 

by  lemma  4. 

From  (14),  (15),  (16),  (17)  we  get 


v~_-_  (18) 

m^tf    m         4logloga; 

11.   From  (13)  and  (18)  we  get 

v       Z±(*?^\^-^L_ 
x<n^<2xm±i™>\     m     f       41oglogo;* 

Now,  from  the  reciprocity  theorem  for  Jacobi's  symbol,  we  have 

m 

(20) 


if  m  is  odd;  and,  by  definition, 


if  m  is  even.  It  now  follows,  from  (19)  and  lemma  3,  that  there  exists  an 
integer  n  with  x  <  n  ^  2#  and  such  that  Saw  +  b  is  quadratfrei,  and  further 


4         logloga;' 
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Now        y  l 


San  4-  b) 

(22) 

-J  =  —  1  [using  6  =  5  (mod8)].  From  (21)  and  (22)  we  get  the 
following  result: 

THEOREM  I.  For  x  >  x0,  there  exists  a  positive  integer  n  satisfying 
(i)  x  <  n  ^  2x, 
(ii)   San-t-b  is  quadratfrei,  and 

,....  "     1  /      m      \       TTZ     c       (l+o(l)} 

m^i  m  \San-\-b)       6         log  log  (San  -4-6)' 

since  log  log  (San  4-  6)  ~  log  log  x. 

Again,  since  I .- 1  is  a  real  primitive  character  (mod  (San  -t-  6)),  when 

&  \San  +  67 

?n  runs  through  all  positive  integral  values  (because  San  +  b  is  quadratfrei) 
we  can  "write  theorem  1  as  follows : 

THEOREM  2.  There  exist  infinitely  many  Jc  such  that 


,—^, 
/J  6         log  log  A; 

where  x(n)  denotes  a  real  primitive  character  (mod&).   In  fact  such  a  k  exists 
between  x  and  2x  for  all  large  x. 
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THE  GEOMETRY  OF  THE  BINARY  (3,  1)  FORM 
By  J.  A.  TODD 

[Received  and  read  16  January  1947] 

1.  The  object  of  this  paper  is  to  interpret  geometrically  the  20  con- 
comitants of  the  binary  (3, 1)  form  which,  as  I  have  recently  shown,*  con- 
stitute the  complete  irreducible  system  for  the  form. 

A  binary  (m,  n)  form  a™a,y  in  two  sets  of  binary  variables  (x0,  xj  and  (yQ,  y^) 
may  be  interpreted  geometrically  in  the  following  manner.  |  If  we  put 

X  =  x0yQ,     Y  =  x0yly     Z  =  x^yQ)     T  =  x^y^  (1) 

and  interpret  (X,  Y,  Z,  T)  as  homogeneous  coordinates  in  space,  then  the 
equations  (1)  are  the  parametric  equations  of  the  quadric  surface  Q  whose 
equationis  XT-YZ  =  Q.  (2) 

Those  points  of  Q  for  which  the  ratio  #0 :  x±  is  fixed  lie  on  a  generator  of  one 
system,  (x)  say,  on  Q,  while  those  points  for  which  the  ratio  y$  :y±  is  fixed 
lie  on  a  generator  of  the  opposite  system  (y).  If/=a™aJ  is  a  binary  (m,n) 
form,  then  the  equation/  =  0  determines  an  (m,  n)  correspondence  between 
the  two  systems  of  generators  on  Q,  and  the  locus  of  points  of  intersection 
of  corresponding  generators  is  a  curve  on  Q,  of  order  m  -f  n,  meeting  the 
generators  of  the  systems  (x),  (y)  in  n,  m  points  respectively.  That  is,  the 
points  represented  by  the  vanishing  of  an  (m,  n)  form  lie  on  an  (n,  m) 
curve  on  Q. 

In  the  case  of  a  (3, 1)  form,  with  which  we  are  immediately  concerned,  the 
curve  C  which  represents  the  form  is  a  rational  quartic  curve  having  the 
generators  (y)  as  trisecants  and  meeting  each  of  the  generators  (x)  in  a 
single  point.  The  geometrical  properties  of  such  a  curve  have  been  inten- 
sively studied,  J  and  we  shall  recall,  in  the  next  section,  the  facts  which  are 
relevant  to  the  present  account. 


*  Todd,  Proc.  Camb.  Phil.  Soc.  42  (1946),  196-205. 

t  This  representation  has  been  applied  to  the  (2,  2)  form  by  Turnbull,  Proc.  Roy. 
Soc.  Edinburgh,  44  (1924),  23-50. 

%  A  detailed  account  is  given  by  Telling,  The  rational  quartic  curve  in  space  of 
three  and  four  dimensions  (Cambridge  Tracts  in  Mathematics  and  Mathematical 
Physics,  no.  34,  1936),  where  proofs  of  the  statements  made  below  will  be  found. 
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2.  A  non-singular  rational  quartic  curve  C  in  space  lies  on  a  unique 
quadric  surface  Q,  and  has  one  system  (y)  of  generators  of  Q  as  trisecant 
lines.  The  coordinates  of  a  variable  point  on  C  are  expressible  as  homo- 
geneous binary  quartic  polynomials  in  a  pair  of  parameters  (£0,  ^),  and  the 
four  points  of  C  which  lie  in  a  plane  are  represented  by  the  vanishing  of  a 
quartic  form  in  (tQ,  y  which,  as  the  plane  varies,  remains  apolar  to  a  fixed 
quartic  <j><  The  four  points  of  C  for  which  (j>  =  0  are  the  points  of  contact  of 
C  with  its  four  stationary  osculating  planes.  The  two  invariants  /  and  J 
of  the  binary  quartic  <f>  vanish,  respectively,  if  the  parameters  of  these  four 
points  form  an  equianharmonic  or  a  harmonic  set.  The  four  points  of  C 
represented  by  the  vanishing  of  the  hessian  H  of  <j>  are  the  double  points  of 
the  g\  (involution  of  triads  of  points)  traced  on  C  by  its  trisecants,  that  is, 
they  are  the  points  of  contact  of  C  with  the  four  tangents  which  meet  C  again. 
The  six  points  on  C  represented  by  the  vanishing  of  the  sextic  covariant 
of  (j>  form  three  mutually  harmonic  pairs  of  points;  the  three  chords  joining 
these  pairs  of  points  are  concurrent  in  a  point  O,  and  their  three  polar  lines 
with  respect  to  Q  lie  in  the  polar  plane  TT  of  O  with  respect  to  Q.  These 
three  chords,  and  their  polar  lines,  form  the  edges  of  a  tetrahedron  with 
respect  to  which  Q  is  self-polar,  and  the  curve  C  is  invariant  under  the 
three  harmonic  inversions  with  respect  to  pairs  of  opposite  edges  of  the 
tetrahedron. 

The  planes  which  meet  C  in  equianharmonic  tetrads  of  points  envelop 
a  quadric  S,  and  the  planes  which  meet  C  in  harmonic  tetrads  envelop  a 
Steiner  surface  k  which  has  O  as  a  triple  point  and  the  three  chords  of  C 
which  pass  through  O  as  double  lines;  and  the  curve  C  is  an  asymptotic 
line  on  this  surface. 

3.  It  is  well-known  that  any  general  pencil  of  binary  cubics  can  be 
regarded  as  the  pencil  of  first  polars  of  a  binary  quartic.*  Hence,  by  a  linear 
transformation  of  the  variables  (yQ,yi)  the  (3, 1)  form  a\&v  can  be  written 
in  the  form  g. 

- 


*  I  find  no  reference  for  this,  though  the  result  is  familiar.  A  simple  proof  is  as 
follows.  Let  r  and  s  be  two  binary  cubics  in  (xot  xj.  Then  the  partial  derivatives 
rf,  8{  of  r  and  3  with  respect  to  xt  (i  —  0,  1)  are  quadratics.  Since  any  four  binary 
quadratics  are  linearly  dependent  there  exist  constants  Af,  pt  such  that 


Thus,  if  JR  =  A0r-fA15,  S  =  fi0r  +  /*!«,  dR/dx0  —  dS/dx^.  Hence  there  is  a  binary 
quartic  i/r  such  that  di/r/dxQ  =  S,  di/r/dx^  —  R.  The  pencil  of  first  polars  of  ft  is  the 
pencil  defined  by  R  and  S,  and  hence  these  polars  all  depend  linearly  on  r  and  s. 
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where  ^  is  a  binary  quartic  in  (#0,  a^).  In  general,  by  a  linear  transformation 
of  the  variables  (x0,  «x),  we  can  arrange  that  i/r  takes  the  form 


and  so  we  may  take  our  binary  (3,  1)  form  to  be 

f^ax30y0+^bx^x1y1-{-3bx0xlyQ  +  axly1.  (3) 

This  is,  essentially,  a  canonical  form*  for  the  binary  (3,  1)  form.  The  con- 
comitants of  the  form  (3)  have  been  given  explicitly  at  the  end  of  the  author's 
paper  already  quoted.  It  will  not  be  necessary  to  reproduce  the  entire  list 
here,  but  it  is  desirable  to  give  a  resume  of  the  20  forms  of  the  complete 
system.  They  are  given  in  the  following  table,  in  which  an  (m,n)  form  is 
indicated  by  the  symbol  (ra,n)  following  its  definition  in  terms  of  trans- 
vectants  : 

Degree  Forms 

1          (M). 


2  F~(fJ)^  (4,0),     /^(/>/)2'°  (2,2),     Ias(/,/)M  (0,0). 

3  j=(f,Fy-»  (5,1),     ls(/,A)i.o  (3,3),     p=(f,F)*.*  (1,1). 

4  (f,p)°-L  (4,0),     (/,j?)i.o  (2,2),     A=(M)2'°  (0,4). 

5  (h.p)^1  (3,1),     (h,pY>*  (1,3). 

6  0»o-i  (6,0),     (t,p)°>i  (4,2),     (l,pF-°  (2,4),     I6^(p,p)l>1  (0,0). 

7  (/,^2)2'0  (1,3). 

8  (h,p*)z><>  (0,4). 

9  (t,^)*0  (1,5). 
12  (t,p*)*'«  (0,6). 

It  will  be  observed  that  the  bilinear  form  p  occupies  a  prominent  position 
in  the  scheme,  since  12  of  the  20  concomitants  are  expressed  as  trans- 
vectants  of  a  power  of  p  with  some  other  form. 

4.    We  now  represent  the  (3,  1)  form  (3)  by  means  of  a  (1,  3)  curve  on  the 
quadric  XT—  YZ  =  0  in  the  manner  explained  in  §  1.  From  (3)  we  see  that, 


From  this  it  is  easily  seen  that,  if/  =  0,  the  point 
(X,  7,  Z,  T)  = 


*  Precisely,  it  becomes  canonical  if  we  divide  by  a  and  put  b/a  =  m.  The  yse  of 
tho  homogeneous  form  (3),  however,  has  the  advantage  that  the  degree  of  any  con- 
comitant can  be  seen  by  inspection. 
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lies  on  the  curve  C  whose  parametric  equations  are 


where  tQ:t1  =  XQ  :  x±.  (5) 

The  binary  quartics  X,  Y,  Z,  T  defined  in  (4),  and  any  linear  combination 
of  them,  are  all  apolar  to  the  quartic 

<t>  =  bt*-2atlt\  +  btl  (6) 

and  accordingly  the  four  stationary  osculating  planes  of  C  have  their  points 
of  osculation  given  by  <j>  =  0.  The  invariants  /  and  J  of  this  quartic  are 

/  =  bz  +  %az,     J  =  - 
and  the  discriminant  of  0  is 


Now  the  invariants  72  and  J6  of  the  form  /  are  given  by 
72  =  2(«2  +  362),     /6  =  1862(a2-62)2. 

Hence  /2  vanishes  if  the  fundamental  quartic  <j>  is  equianharmonic,  and  J6 
vanishes  if  <j>  has  a  repeated  factor.  *  When  this  is  the  case,  the  curve  C  has 
a  stationary  tangent  line.f 
The  hessian  of  (f>  is 

H=-l  [abt*  +  (a2  -  362)  *2  1\  +  abtfi  . 
Since  the  (4,  0)  form  F  is  given  by 
F  = 


we  see  from  (5)  that  F  —  0  represents  the  four  generators  (#)  of  Q  whose 
intersections  with  C  are  given  by  H  =  0,  and  we  have  seen  in  §  2  that  these 
points  are  the  double  points  of  the  g\  cut  on  C  by  its  trisecants. 

The  sextic  covariant  of  ^  is  a  multiple  of  £0M^o  ~  ^i)>  an<^  ^ne  ^nree  pairs 
of  mutually  harmonic  points  on  C  represented  by  the  vanishing  of  this 
covariant  are  given  by  the  pairs  of  values  oo,0;  1,  —  1;  i,  —i  of  ^0/^.  The 
coordinates  of  these  three  pairs  of  points  are  thus 

(0,1,0,0),  (0,0,1,0);     (-1,1,  -1,1),   (1,1,  -1,-1); 

(i,  -1,M),  (-i,  -1,1,  -i), 

*  The  canonical  form  is  no  longer  valid  in  this  case;  in  fact  the  form  (6)  has  a 
repeated  factor  only  when  it  is  a  perfect  square. 
f  Telling,  loo.  cit.  ante,  69. 
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and  the  three  lines  joining  the  pairs  are 


which  meet  in  the  point  O  whose  coordinates  are  (0,  1,  —  1,  0).  The  polar 
plane  n  of  O  with  respect  to  Q  is  thus  the  plane  Y  —  Z  =  0,  and  this  meets 
Q  in  points  of  a  conic  expressed  by  the  vanishing  of  the  form 


Reference  to  the  list  of  concomitants  of  the  form  (3)  in  the  author's  paper 
already  quoted  shows  that 


Hence  the  section  of  Q  by  the  plane  n  is  given  by  p  =  0.  We  observe,  more- 
over, that  since  76  ==  (p,  p)l>1  the  vanishing  of  /6  is  the  condition  that  p  should 
be  the  product  of  linear  factors  in  (#0,  x^)  and  (?/0,  y^  respectively,  that  is, 
it  is  the  condition  that  TT  touches  Q. 

Moreover,  if  g  is  any  (m,  n)  form,  the  transvectant  (<7,^A)A>0  (for 
is  simply  a  multiple  of 


It  therefore  admits  the  following  interpretation.  On  any  generator  of  the 
system  (y),  the  form  g  determines  a  set  of  m  points.  The  generator  meets  n 
in  a  point  P  for  which  #0  :  xt  =  ?/0  :  yv  The  form  (g,p*)*'°  determines,  on  this 
particular  generator,  the  Ath  polar  group  of  P  with  respect  to  the  m  points 
on  the  generator  given  by  g  =  0.  Hence  (<7,j»A)A>0  represents  a  curve  on  Q 
which  is  the  locus  of  these  polar  groups  as  the  generator  varies  in  the 
system  (y).  A  precisely  similar  interpretation  of  the  form  (</,£>A)°'A  (A^ti)  is 
obtained  by  interchanging  the  parts  played  by  the  two  systems  of  generators. 

Thus  all  the  forms  of  our  system  will  have  been  accounted  for  if  we  inter- 
pret those  forms  of  the  system  whose  definition  does  not  involve  p.  Since 
/  itself,  the  form  F,  and  the  invariants  have  already  been  considered,  the 
forms  which  remain  are  h,  t,j  and  A. 

Now  h,  t,  A  are  respectively  the  hessian,  cubic  co  variant,  and  discriminant 
of/  regarded  merely  as  a  binary  cubic  in  (x0tx±).  Thus,  on  any  generator 
of  the  system  (y),  the  two  points  given  by  h  =  0  are  the  hessian  pair  of  the 
three  intersections  of  the  generator  with  O,  and  the  equation  h  =  0  deter- 
mines a  (2,  2)  curve  on  the  quadric  which  is  the  locus  of  these  points  as  the 
generator  (y)  varies.  The  interpretation  of  the  form  t  is  exactly  similar.  And 
the  form  A,  equated  to  zero,  clearly  represents  the  four  generators  of  the 
system  (y)  which  touch  C.  The  form  j  is  the  Jacobian  of  F  and/,  regarded 
as  binary  forms  in  (a?0,  o^);  and  can  thus  be  interpreted  in  a  similar  way. 
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5.  All  the  20  forms  of  our  system  are  thus  accounted  for,  though  the 
geometrical  significance  of  some  of  the  forms  is  not  very  simple.  This  is 
perhaps  natural  since,  particularly  in  the  case  of  forms  of  high  degree,  the 
form  selected  as  a  member  of  the  complete  system  off  may  be  one  among 
many  other  forms  of  the  same  degree  and  orders  in  the  variables  which  differ 
from  each  other  by  reducible  forms,  and  the  actual  selection  of  a  particular 
member  of  such  a  set  of  equivalent  forms  has  been  made  with  a  view  to 
algebraic  convenience  and  has  no  clear  geometrical  significance. 

One  feature  of  the  system  deserves  notice.  An  inspection  of  the  list  of 
forms  shows  that,  in  the  complete  system  of/,  (m^ri)  and  (n,  m)  forms  are 
equally  numerous,  a  fact  which  is  not  a  priori  evident  since  /  itself  is  not 
symmetrical  with  respect  to  its  two  sets  of  variables.  The  explanation  of 
this  fact  is  the  following.  The  quadric  Q  is  unaltered  by  the  harmonic  in- 
version with  respect  to  O  and  its  polar  plane  TT,  and  this  transformation 
interchanges  the  two  systems  of  generators  on  Q  in  such  a  way  that  corre- 
sponding generators  meet  on  TT.  Thus  any  (m,n)  form  represents  a  curve 
Ci  on  Q  which,  under  this  transformation,  gives  rise  to  a  curve  C2  repre- 
sented by  an  (n,  m)  form,  one  of  these  forms  (to  within  a  constant  factor) 
being  obtained  from  the  other  by  interchange  of  (x0,xt)  with  (2/0>2/i)>  &nd 
the  two  curves  being  in  perspective  from  O.  Reference  to  the  table  of 
explicit  forms  shows  that  many  of  the  concomitants  in  our  list  are  paired 
off  in  this  way,  and  it  is  not  difficult  in  those  cases  where  this  symmetry 
with  respect  to  the  variables  does  not  exist  to  obtain  it  by  replacing  one  or 
other  of  the  forms  by  an  equivalent  form  differing  from  it  only  by  reducible 
terms.  To  take  just  one  instance;  there  are  two  (4,0)  forms'^  and  (f,p)0>l, 
and  two  (0,  4)  forms  A  and  (h,p2)2>°.  Of  these,  F  and  (h,p2)2'®  are  obtained 
from  each  other  by  interchange  of  the  two  sets  of  variables,  but  (/,  p)Q>  L  and 
A  are  not  related  in  this  manner.  But  it  is  easily  seen  that  if  (f,p)0'1  is 
replaced  by  the  equivalent  form  —  $(f,p)°'*  —  \I%F  the  desired  symmetry 
appears. 

6.  We  conclude  by  considering  the  curves  cut  on  Q  by  the  quadric  S 
and  the  Steiner  surface  &,  enveloped  by  the  planes  which  meet  C  in  equi- 
anharmonic  or  harmonic  sets.    It  is  convenient  to  make  the  change  of 
coordinate  system  in  space  defined  by 


,     T=Y-Z.  (7) 

The  parametric  equations  of  C  are  then 

£  =  «(#-£),  y  =  -(a  +  36)  (^!  +  Mi)» 

( 
£=  (a-  36)  («!!*!-  <o#),     r  =  a#  +  66«g*l  +  a«}, 

CC2 
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and  the  plane  Z£  +  my  +  n£  +pr  =  0  meets  C  in  the  four  points  given  by  the 
vanishing  of  the  form 


li.     (9) 

The  condition  that  (9)  be  equianharmonie  is  that  its  invariant  of  the  second 
degree  vanishes,  a  condition  which  reduces  to 

-4a2Z2-(a+36)2m2-f-(a-36)2^2  +  4(a2  +  362):p2  =  0, 
which  is  the  tangential  equation  of  S.  The  point  equation  of  S  is  therefore 

(a2  -  962)2  [a2r2  -  (a2  +  362)  £2]  +  4a2(a2  +  362) 

x[(a  +  3&)2f2-(a-36)V]  =  0.     (10) 
On  putting 


this  reduces  to 

-  362(a2  -  9/,2)2  (v*y\  +  *2l2 

+  6(a2  -  62)  (2a4  +  45a262  +  816*)  ^0^i2/o2/i-     (12) 

This  must  be  a  covariant  (2,2)  form  associated  with  /  (to  within  a  factor 
involving  a  and  6).  The  only  (2,  2)  forms  in  the  system  are  h,  (/,p)1>0  and^>2, 
of  respective  degrees  2,  4  and  6.  And  it  is  easily  verified,  on  trying  to  express 
(12)  as  a  linear  combination  of  these  forms,  that  the  coefficients  in  the  linear 
combination  are  in  fact  proportional  to  invariants  of/,  and  that  the  inter- 
section of  Q  and  S  is  given  by 

Ith  +  4lz(f,pF><>-2p*  =  0.  (13) 

The  condition  that  (9)  be  a  harmonic  quartic  is  expressed  by  the  vanishing 
of  its  catalecticant.  After  some  reduction,  this  gives 

2a(a2  -  962)  Imn  +p[8azbl*  +  (a  +  36)2  (a  -  b)  m*  +  (a  -  36)2  (a  +  6)  w2 

-86(a2-62)j92]  =  0,    (14) 

which  is  the  tangential  equation  of  the  surface  k.  The  point  equation  of 
this  surface  is 


K  SEE  8a26?;2£2  +  (a  +  36)2  (a  -  6)  £2£2  +  (a  -  36)2  (a  +  6)  ^ 

+  2«(a2-962)g^r  =  0.     (16) 

The  simplest  proof  of  this  is  a  verification  as  follows.  The  surface  (15) 
admits  the  parametric  representation 

£  =  2auv,  ?}  —  —  (a  4-  36)  uw, 

£  =  (a  —  36)  vw,     T  • 
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and  the  coordinates  (l,m,n,p)  of  the  tangent  plane  to  the  surface  at  the 
point  whose  parameters  are  u,  v,  w  are  given  by  substituting  from  (16)  in 
d<K"/9£,  .  .  .  ,  dK/dr.  After  a  little  reduction  and  removal  of  a  common  factor 
2a(a2  -  962)  uvw  we  find 


m  =  2a(a-3b)[-(a  +  b)u2+(a-b)v2-2bw2]v, 
n  =  2a(a  +  3b)[ 
p  —  —  2a(a2  —  9 

and  these  values  of  I,  m,  n,  p  are  found  to  satisfy  (14)  identically  in  u,  v,  w. 
When  we  substitute  from  (11)  in  (15),  we  find  that  the  intersection  of  K 
with  Q  is  given  by  the  vanishing  of  the  (4,  4)  form 

l  -  x2y2)2  +  (a  +  36)2  (a  - 
+  (a  -  36)2  (a  +  6)  (x^ 

+  2a(a2-  962)  (*%yl-x*y*)  (x20y2-x2y2). 


This  form  contains  /  as  a  factor  (as  it  should,  since  C  lies  on  K),  and  the 
remaining  factor  proves  to  be  a  multiple  of 


which,  from  the  table  of  explicit  forms,  we  identify  with  (h,p)l>0.   Hence 
the  residual  intersection  of  K  and  Q  is  given  by 

(h,p)l-o  =  0. 


Trinity  College 
Cambridge 
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THREE-DIMENSIONAL  SPACES  OF  RECURRENT  CURVATURE 

By  H.  S.  RUSE 
[Received  3  February  1947 — Read  20  February  1947] 

1 .    A  Riemannian  Vn  (of  fundamental  tensor  gi:f)  for  which  the  covariant 
derivative  of  the  curvature  tensor  satisfies  the  relation 


at'  all  points,  for  some  vector-field  KP,  will  be  called  a  space  of  recurrent 
curvature,  or,  briefly,  a  recurrent  space,  and  will  be  denoted  by  Kn.  Such 
spaces  have  hitherto  been  called  by  the  non-descriptive  name  kappa  spaces 
(Ruse  (6)).  The  new  name  is  chosen  to  emphasize  the  property  that  defines 
them,  namely,  that  the  curvature  tensor  recurs,  so  to  speak,  under  co- 
variant differentiation.  (!•!)  will  be  called  the  recurrence-condition. 
Every  V2  is  a  Kz  (Ruse  (5)),  the  vector  KP  being  given  in  a  V2  by 


where  K  is  the  Gaussian  curvature  J?1212/gr.  If,  therefore, 


defines  the  metric  of  a  V2,  and  if  dl  is  the  line-element  of  an  En_2  (i.e.  of  a 
flat  Vn_%)  of  variables  #3,  #4,  ...,  xn,  then  the  product-space  defined  by 

ds2  =  dsl  +  dl*  (1-3) 

is  a  Kn  with  KP  a  function  of  xl,  x2  only.  This  is  readily  verifiable,  the  Riemann 
tensor  for  (1-3)  having  essentially  only  the  one  component  -/2m2-  A  Kn 
defined  in  this  way  will  be  called  a  flat  extension  of  the  Vz.  More  generally, 
the  product  Vr  .  En_r  of  any  Vr  and  an  En_r  will  be  called  a  flat  extension  of  Vr. 

Spaces  JC4  that  also  have  the  property  of  being  simply  harmonic  have 
appeared  in  the  papers  quoted. 

Recurrent  spaces  for  which  ^  =  0  are  symmetric  in  Cartan's  sense 
(Cartan  (I,  2)).  Their  properties  are  well  known. 
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The  present  paper,  which  is  confined  to  the  case  n  —  3,  is  intended  as 
a  further  step  towards  determining  the  nature  of  recurrent  spaces  in  general. 
It  is  assumed  throughout  (except  for  a  brief  further  reference  to  symmetric 
spaces  in  §  9)  that  Kp  is  not  identically  zero,  which  excludes,  incidentally, 
the  case  when  V3  is  of  constant  (non-zero)  curvature.  It  is  also  assumed 
that  V3  is  not  flat.  In  this  paper,  therefore,  a  K%  will  mean  a  non-flat  Vs  in 
which  (1-1)  holds  for  some  KP  not  identically  zero. 

The  main  conclusion  of  the  paper,  reached  in  §  6,  is  that  a  necessary  and 
sufficient  condition  for  a  V3  to  be  a  K%  is  that  it  should  admit  afield  of  parallel 
vectors,  which  may  be  either  non-null  or  null, 

If  the  curvature-scalar  R  is  non-zero,  then  multiplication  of  (1-1)  by 

ghkgij  at  once  gives  « 


It  will  be  seen  below  that  KP  is  likewise  the  gradient  of  a  scalar  even  when  H 
is  everywhere  zero. 

2.    Hereafter,  then,  it  is  assumed  that  n  =  3  and  that  the  K,  under  dis- 
cussion is  a  K3.  The  Bianchi  identity  gives,  by  (1-1), 


This  may  be  written  emn8RhimnKa  =  °> 

emns  being  the  dualizing  tensor  of  components  (  ±  1/^/gr,  0),  and  so  imaginary 
if  g  <  0.  Multiplying  this  by  ^ehir,  we  get 

Qr8Ks  =  0,  (2-2) 

where  Qra  = 


is  the  dual  of  the  Riemann  tensor  for  a  TCj. 

In  the  projective  space  $2  associated  with  any  given  point  (x1)  of  Ks,  the 
tensor  Qrs  represents  the  Riemann  conic-envelope  (Ruse  (4),  §  3)  of  equation 
QT3PrPs  =  0  in  current  line  -coordinates  pi3  and  (2-2)  states  that  it  is  degen- 
erate, consisting  of  a  pair  of  points,  I1,  mi,  say,  on  the  line  Kt.  In  K%  we  can 
therefore  find  vectors  I1,  m{  such  that 


and  therefore  such  that         Q**  ==  ^Pm*  +  m*V). 

Hence  Rhijk  s  \(lMmik  +  mhiljk),  (2-3) 

where  ^  =  eiiklk,       mtj  -  e^fcmfc,    ] 
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In  $3,  lti  and  m^  are  dual  coordinates  of  the  points  I1,  mi.  These  points  lie 
on  the  line  K^  Let  Ai  be  any  line  through  I1  other  than  K^  Then,  with  a  proper 
normalization*  of  the  components  of  A^,  we  have 

*«-^K,-*«A/.l  (2.5) 

and  similarly  mti  ~  ^KJ  —  Kiflp) 

/it  being  a  line  through  mi. 

The  geometrical  configuration  in  $2  therefore1  consists  of  three  lines 
forming  a  triangle  LMN.  L  is  the  point  I1  (or  l^)  and  M  the  point  m*  (or  m^.) . 
LM  is  the  line  Kif  and  LN,  MN  the  lines  Ae-,  /it  respectively.  Let  the  co- 
ordinates of  N  be  n1  (or,  dually,  nit).  Then  since  it  is  the  intersection  of  the 
lines  A^,  JLL{ ,  we  have,  with  a  proper  normalization  of  its  components, 

nu  =  A^-/^A,.  (2-6) 

The  plane  £2  also  contains  the  fundamental  conic  of  point-  and  tangential- 
equations  qi^XiXi  —  0  and  g^pipj  =  0  respectively.  The  raising  and 
lowering  of  suffixes  by  means  of  the  fundamental  tensor  corresponds  to 
taking  polars  with  respect  to  the  conic. 

The  various  incidences  are  expressed  analytically  by 

(AJ)  =  0  =  (An)  =  (Kl)  =  (Km)  -  (pm)  =  (/m),  (2-7) 

where  (XI)  denotes  A^*,  and  so  on. 

3.   Differentiating  (!•!)  covariantly,  we  obtain 

Rhijk,p,q  ~   (Kp,q  +  KpKq)Rhijk- 

Interchanging  p,  q  and  using  the  permutation-formula  for  co variant  deri- 
vatives (Eisenhart  (3),  30),  we  obtain  a  relation  that  may  be  written 

Rri jk  I?,  hpq  ~  Rrhjk  &  ipq  +  ^rkhi  &.  jpq  ~  ^rjhi  &.  kpq  ~  (Kp,  q  ~  Kq,  p  )  ^hijk '       (3*1) 

The  second  term  on  the  left-hand  side  is  minus  the  first  with  h,  i  inter- 
changed, and  the  last  two  are  the  same  as  the  first  two  with  the  pairs  (hi),  (jk) 
interchanged.  From  this  it  quickly  follows,  by  substituting  from  (2-3)  in 
(3-1),  that 

)»  W  interchanged} 

jk  +  mhiljk)*     (3'2) 

where  (K/I)  =  K^,  etc.  Taking  the  .dual  in  the  manner  indicated  in  (2-4), 
we  get 

{(KfL)  I1  -  (/cA)  mi  -  (KK)  n1}  (m>lk  -  Vmk) 

+  {(K/I)  V  -  (/cA)  m?  -  (KK)  ri}  (mHk  -  Z'mfc) 

=  2^(^m?'4-m^)>     (3-3) 
where  vk  =  te**«(KM  -  *«.„). 

*  I.e.  choice  of  the  factor  of  proportionality. 
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Let  ai  be  an  arbitrary  line  in  S2  through  the  point  I1,  so  that  ((rl)  —  0. 
Multiply  (3-3)  by  er^cr,.,  summing  for  i,j.  We  then  get 

2{  -  (/cA)  (orm)  -  (KK)  (cm)}  (<rm)  lk  =  0, 

and  hence*  either     (a)     (vm)  =  0 

or  (6)     (/cA)  (crm)  +  (KK)  (an)  =  0 

for  all  lines  or  such  that  (crl)  =  0. 

In  case  (a)  we  have  cr^ra1  =  0  whenever  cr^  =  0,  and  therefore 

/*  =  Om1,     i.e.     ltj  —  Om^, 

0  being  a  scalar.  The  points  l\  mi  in  $2  coincide.  Substituting  in  (2-3)  and 
absorbing  the  scalar  ^0  into  mip  we  deduce  that  the  Riemann  tensor  must 
have  the  form 

RMik  =  WhiMjk'  (3'4) 

where  mhi  is  a  bivector. 

In  case  (b)  it  follows  similarly  that 

(/•cA)  mi  +  (KK)  n*  =  (f>l\  (3-5) 

<j>  being  a  scalar.  If  (KK)  —  0,  then  we  deduce  once  again  that  the  points 
^,  m^  coincide,  and  (3-4)  follows.  If  (*7c)=t=0,  then,  multiplying  (3-5)  by 
Kt  and  using  (2-7),  we  get 

(KK)  (Kn)  =  0, 

and  therefore  (Kn)  —  0.  So  the  point  n*  lies  on  the  line  Kt,  and  the  points 
^,  m*  are  coincident  yet  again.  Hence  in  all  cases  the  Riemann  tensor 
reduces  to  the  form  (3-4).  Also  from  (3-2),  with  ii}  —  m^,  we  now  obtain 


and  therefore,  because  m^  is  not  zero  except  in  the  excluded  case  of  a  flat 
space, 

*jMr-*fl.P  =  °- 

Hence  KP  is  the  gradient  of  a  scalar  K.  It  will  still  be  denoted  by  KP,  though 
consistency  of  notation  now  really  requires  it  to  be  denoted  by  K  tp. 

The  figure  in  $2  now  consists  of  two  lines  Kit  fa  meeting  in  the  point  M  of 
coordinates  ra*  (dually  m^),  together  with  the  fundamental  conic.  The  tri- 
angle LMN  has  shrunk  into  the  point  M  . 


*  The  possibility  lk  —  0  is  excluded,  because,  by  (2-3),  this  would  lead  to  the  case 
of  a  flat  space. 
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4.    Thus  in  every  K3,  the  Riemann  tensor  has  the  form 


where  m^  =  faK^  —  Ki^.  (4*2) 


If  we  substitute  from  (4-1)  in  the  recurrence-condition  (1-1)  and  multiply 
the  resulting  equation  by  skl,  where  skl  is  any  bivector  such  that  sklmkl  =|=  0, 
we  quickly  obtain  (cf.  Ruse  (6),  equations  (3-9),  (3-  10)) 


Let  rif  =  e-*Kmip  (4-4) 

so  that,  by  (4-1),  Rhm  =  e*rhirjk.  (4-5) 

In  $2,  rtj  represents  the  same  point  M  as  ra^,  and  we  write  as  usual 

ri  =  \^rjk.  (4-6) 

By  (4-4)  and  (4-3),       rijf  p  =  e-**(mt,tp  -  ^Kpmtj) 

=  0, 
and  hence,  by  (4-6),  r\p  =  0.  (4-7) 

Thus  any  K^  admits  a  field  of  parallel  vectors  r*,  the  Riemann  tensor  being 
expressed  in  terms  of  the  dual  ofr1  by  (4-5). 

5.  If  ri  is  not  a  null  vector  -field,  then  the  metric  of  K3  is  reducible  to 
the  form 

ds2  =  2  a^dx^dxP  ±  (dx9)2,  (5-1) 

1,2 

where  the  aa/s  are  independent  of  a:3  (Eisenhart  (3),  71).  This,  by  (1-3),  is 
already  known  to  define  the  metric  of  a  recurrent  space,  and  so  we  have  a 
partial  converse  to  the  above  theorem,  namely:  If  a  V3  admits  a  field  of 
non-null  parallel  vectors,  it  is  a  Ks.  Such  a  K3  is  a  flat  extension  of  a  K2, 
being  the  product  of  a  K2  and  an  E±. 

6.  Multiply  (4-5)  by  ghkff**.  We  get 

R  =  - 


Therefore  R  =  0  if  rprp  =  0,  and  conversely.  Thus  the  curvature-scalar  R 
for  a  K3  is  zero  if,  and  only  if,  rl  is  a  null  vector  -field. 
Assume  this  now  to  be  the  case.  Then 

r^ss^w  =  0.  (6-1) 
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Also  by  (4-7),  riip  =  0  =  rPti, 

and  rt  is  therefore  the  gradient  of  a  scalar  r.  Thus 

"-£•  <"'> 

From  (4-7)  it  also  follows  that  the  equation 


for  /  is  completely  integrable  and  so  admits  two  independent  solutions,  of 
which  one,  by  (6-1),  may  be  taken  to  be  r  itself.  Let  u  be  another  solution, 
independent  of  r.  Then  ~ 

r*—  =  0.  (6-3) 

Further,  let  v  be  a  solution  of  the  equation 

*£-.  ««> 

v  is  then  determinate  but  for  the  addition  of  an  arbitrary  function  of  r 
and  u.  Moreover,  the  functions  r,  u,  v  are  independent  because  r  and  u 

were  chosen  so,  and  v  —  F(r,u)  implies  *"*«— %  =  0,  in  contradiction  to  (6-4). 
Therefore  coordinates  may  be  chosen  so  that 

x1  =  u,     x2  =  r,     xz  —  v. 

In  terms  of  these,  ri  =  <£f,  (6-5) 

by  (6-2).  Therefore,  by  (6-1),  (6-3),  (6-4), 

02i  =  o  =  gr22,     023  =  1,  (6-6) 

whence,  taking  the  normalized  cofactors, 

!/31  =  t)  =  <73g,       grgs  s==    1-  (6 *7) 

Also  the  determinant  g  has  the  value 
Moreover,  0  =  r^         by  (4*7) 

(21        x,       to       x 

=  —{..}     by  (6-5), 

lyj 

whence,  by  (6-6),  [ij,  3]  =  0, 

and  therefore,  by  (6-7),  -~~  =  0. 

ox 
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Hence  the  g^  are  functions  ofx1,  x2  only.  Combining  this  with  (6-7),  we  have : 
The  metric  of  any  V3  that  admits  a  field  of  parallel  null  vectors  is  reducible  to 
the  form  d^  =  ^  dx«dxp  +  2dx*dx*,  (6-9) 

1,2 

where  the  aa/?  are  functions  of  xl,  x2  only.* 

Conversely,  any  V3  of  the  form  (6-9)  is  a  KB.  For,  as  is  easy  to  show,  the 
only  non-zero  components  of  the  curvature  tensor  of  (6-9)  are  those  related 
^°  ^i2i2>  while  ap  »  *_ 

1212 


by  (6-8).  Thus  if  we  write      KP  =  •= — log 


fii*i«  (6.10) 


(cf.  (1-2)),  we  have  #1212,^  =  ^^ma- 

Further,  for  all  components  other  than  those  related  to  #12i2> 

Rfiijk.p  —  0  =  KpRhi.jk, 

so  the  curvature  tensor  of  (6-9)  satisfies  the  recurrence-condition. 

Therefore  every  Vz  of  the  form  (6-9)  is  a  Ks.  Combining  this  result  with 
the  theorems  stated  in  §§  4,  5,  we  obtain  the  general  theorem  enunciated 
in§l. 

The  KZ  defined  by  (6-9)  is  a  null  extension  (Walker  (7))  of  the  V2  (i.e.  K%) 
defined  by  ds2  =  ^aa,dx-d^, 

1,2 

and  it  has  therefore  been  shown  that  every  K9  is  either  a  flat  extension  or 
a  null  extension  of  a  K%. 

1.  Apart  from  certain  changes  of  notation  and  presentation,  the  main 
contents  of  §  6  are  due  to  Prof.  A.  G.  Walker,  to  whom  I  am  greatly  indebted. 
By  a  rather  longer  and  less  elegant  method,  I  had  shown  that  the  metric 
of  any  1^  which  admits  a  field  of  parallel  null  vectors  is  reducible  to  the  form 

ds2  =  edx*  +  ft(x,y)dy2  +  2dydz     (e  =  ±1).  (7-1) 


For  this,  <7  =  -e,     ^1212  =  --  (7-2) 

It  is  easy  to  see  that  (6-9)  may  be  reduced  by  a  transformation  of  coordinates 
to  this  even  simpler  form. 

*  This  is  a  special  case  of  a  general  theorem  for  a  Vn  due  to  A.  G.  Walker  (7). 
[Note  added  in  proof.    Cf.  Eisenhart,  Annals  of  Math.  39  (1938),  316-321.] 
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It  may  here  be  noted  that,  if  En_3  is  a  flat  space  of  n  —  3  dimensions,  and 
K%  is  the  space  defined  by  (6-9)  or  (7-1),  then  the  product-space  K^.En_9) 
which  is  a  flat  extension  of  .K"3,  is  a  Kn. 

8.    The  vector  KP  appearing  in  (6-10)  may  be  expressed  in  invariant  form 

as  follows.  By  (4-5),  D 

J  V      '  Rkimn  =  tKrkirmn, 

whence,  by  (4-6),  e*rV  =  leikleimnRklmn. 

Multiplying  this  by  ^—^  ^—  .  ,  using  (6-4),  we  cet 


where  tkl  =  eikl~.. 

dx'1 

THUS  ^  =  g^log|«ro"^Mn|- 

This  result  is  also  due  to  A.  G.  Walker,  who  obtained  it  by  a  slightly  different 
method. 

9.  The  arguments  in  §§2,3,4  by  which  any  K3  was  shown  to  admit  a 
parallel  vector-field,  and  so  to  be  reducible  to  one  of  the  forms  (5-1),  (6-9) 
(or  (7-  1  )),  clearly  break  down  if  the  space  is  symmetric  (KP  ==  0).  Nevertheless, 
since  all  spaces  of  these  forms  satisfy  the  recurrence-condition,  a  particular 
non-flat  symmetric  space  is  obtainable  from  (7-1)  by  making  jcp  =  0.  This, 
by  (6-10)  and  (7-2),  requires  that  *js(x,y)  should  be  of  the  form 

ty(x,  y)  —  ax*  +  x<j)(y]  +  x(y)     (a  constant), 

making  R^\^  —  ~~a-  In  ^ne  particular  case  when  <j>(y)  and  %(«/)  are  constants, 
there  is  no  loss  of  generality  in  taking  <j>  =  0  =  x  and  a  —  e'  (=  ±  1), 
since  this  selection  of  values  merely  implies  a  linear  transformation  of  the 
variables  x,  «/,  2,  and  we  therefore  obtain 

ds2  =  edx*  +  e'x2dy2  +  2dijdz  (9-1) 

as  a  special  (symmetric)  case  of  (7-1).  It  is  of  interest  to  note  that,  with 
e  =  I  =  ef,  this  is  the  symmetric  V%  given  as  an  example  by  Cartan  ((1),  217). 

10.  In  one  of  the  earlier  papers  already  quoted  (6),  I  showed  that  the 
metric  of  every  simply  harmonic  K4  is  reducible  to  the  form 

ds*  =  adx*  +  2ydxdy  +  pdy*  +  2dxdz  +  Zdydt,  (10-1) 


where  a,  ft  are  functions  of  x,  y  only,  and,  conversely,  that  every  V4  of  this 
form  is  a  simply  harmonic  K±.  The  method  of  deriving  this  result  rather 
obscured  one  point  to  which  my  attention  has  been  called  by  Prof.  Walker, 
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to  whom  I  am  again  indebted.  The  point  in  question  is  that  a  necessary  and 
sufficient  condition  for  a  V^  to  be  a  simply  harmonic  K4  is  that  it  should  admit 
two  parallel  vector  -fields  ,  the  vectors  at  any  point  being  null  and  perpendicular. 
The  necessity  of  this  condition  was  clearly  brought  out.  The  sufficiency, 
which  was  less  clearly  demonstrated,  depends  on  the  fact  that  the  possession 
by  a  V^  of  two  such  parallel  vector-fields  is  alone  sufficient  to  ensure  its 
reducibility  to  the  form  (  10-  1  ).  In  Walker's  terminology,  the  V±,  or  rather  K^ 
is  a  null  extension  of  the  Vz  denned  by 


No  reference  has  been  made  in  the  earlier  sections  of  this  paper  to  the 
possibility  that  the  K%  might  be  simply  harmonic.  This  is  because  any 
simply  harmonic  Vn  is  an  Einstein  space  of  zero  scalar  curvature  (i.e.  such 
that  RIJ  —  0),  so  that,  inasmuch  as  any  Einstein  V2  is  of  constant  curvature, 
a  simply  harmonic  V3  is  everywhere  of  zero  curvature  and  therefore  flat. 
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ON  THE  TOTAL  RELATIVE  STRENGTH  OF  THE 
HOLDER  AND  CESARO  METHODS 

By  S.  K.  BASTJ 

[Received  and  read  20  February  1947] 

1 .    Introduction 

The  (C,  r)  and  (H,  r)  methods  of  defining  a  limit  of  a  sequence,  where  r  is 
a  positive  integer,  have  been  shown  by  various  authors*  to  be  equivalent! 
in  the  case  where  the  limit  is  finite.  Hausdorff  J  extended  the  definition  of 
the  Holder  means  to  negative  and  non-integral  orders  and  established  the 
equivalence  of  the  (C,  a)  and  (H,  a)  methods  for  a  >  —  1.  It  is  known  that,§ 
for  r  =  2,  3,  4, ...,  the  (Hyr)  method  totally  includes  \\  the  (C,r)  method,  but 
the  (C,r)  method  includes  the  (H,r)  method  only  non-totally.  Recently 
Dr  Bosanquet^J  has  shown  that  the  (H,r)  method  (r  =  2,  3,  4,  ...)  is  not 
totally  included  in  any  of  the  (C,  s)  methods  (s  >  r).  For  r  —  1 ,  the  two  methods 
are  identical  arid  as  such  each  totally  includes  the  other.  The  question  then 
arises  as  to  what  one  can  say  of  this  problem  of  total  relative  strength  for 
1  <  r  <  2,  or  for  —  1  <  r  <  1.**  The  question  as  to  whether  the  (H,  r)  method 
totally  includes  the  (C,  r)  method  for  non-integral  r  >  2,  also  seems  to  remain 
open.  The  present  paper  is  an  attempt  to  answer  these  questions.  We  have 
followed  here  Hausdorff 's  definitions  of  the  Holder  and  Cesaro  means. 

The  author  takes  this  opportunity  of  thanking  Dr  Bosanquet  for  the 
indication  of  .the  problem  and  for  valuable  suggestions  and  criticism 
throughout. 


*  For  references  see  (4),  19. 

f  I.e.  whenever  a  sequence  is  summable  by  either  of  the  methods,  it  is  also 
summable  by  the  other  and  to  the  same  limit. 

t   (1).  §    (5),  453,  or  (7),  245. 

||  A  method  of  summation  A  is  said  to  include  another  method  of  summa- 
tion B  totally,  if  every  sequence  summable  by  13  to  I  is  also  summable  by  A  to  / 
(  —  00^^+00).  If,  however,  this  only  holds  for  —  oo</<+oo,  then  A  includes 
B  non-totally.  In  case  A  and  B  totally  include  each  other,  they  are  said  to  be  totally 
equivalent  (see  (7),  loc.  cit.). 

If  (9),  12.  **  The  trivial  case  for  r  =  0  being  excluded. 
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2.   Hausdorff  transformation  and  some  of  its  familiar  properties 
The  Hausdorff  mean  yn  of  a  real  sequence  (xn)  is  defined  by* 

(w  =  0,l,2,...),  (1) 

n\ 


where  A>w  =  /*m,     A>m  =  S  (  -  1)*  L)/W»     (n  >  *)• 

p=0  \/V 

We  shall  be  mainly  concerned  with  cases  where  the  factor  sequence  (/in) 
is  a  moment  sequence,  given  by 

pn  =  Pu»dxW     (w  =  0,l,2,...),  (2) 

Jo 

X(u),  the  mass  function,  being  a  function  of  bounded  variation  in  0  <  u  <  1, 
and  the  integral  taken  in  the  Riemann-Stieltjes  sense"\.  The  corresponding 
moment  function  u(t)  is  then  defined  by 


=  [ 
Jo 


(3) 
o 

so  that  jii(ri)  =  fin     (n  =  0,  1,  2,  ...). 

Putting  A/t>  m  =  ^  A»->m     (0  ^  m  ^  n), 

n 

(1)  reduces  to  the  form          yn  =  S  An>m:rm.  (4) 

m=0 

This  is  a  linear  transformation  by  means  of  the  triangular  matrix 

AHKJI  (m,  71=0,1,2,...). 

A  Hausdorff  matrix  A  is  always  of  the  form 

A  -  ppp-1, 
where  p  is  the  triangular  matrix  with  the  elements 


and  /*  is  a  diagonal  matrix  given  by 

/<  =  II /*„,„,  II, 

;° 
where 


*   (1),  79.    If  we  are  concerned  with  the  series  S  av,  we  shall  take  xn  —  S  oy. 

v=0  v==0 

t  The  function  u°  is  denned  for  w  =  0  so  as  to  be  continuous. 
$  Evidently  pp  —  E,  the  identity  matrix,  so  that  p  =  p~l,  i.e.  p  is  its  own  reciprocal. 
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We  shall  here  restrict  ourselves  to  cases  in  which 
/tn*0     (»  =  0,1,2,...), 

so  that  the  reciprocal  matrix  /i~l  always  exists.  *  All  Hausdorff  matrices 
are  permutable  with  one  another.  If  A,  A'  are  the  matrices  corresponding 
to  fi,  /*',  then 

(6) 


, 

A  f.1 

In  particular,  if  /£,  //  are  reciprocal  to  each  other,  that  is,  finfi'n  =  1  (f^n  being 
the  elements  of  the  diagonal  matrix  /i'),  so  also  are  A,  X'  and  conversely. 
By  taking  different  suitable  diagonal  matrices  /i,  many  important  linear 
transformations  are  included  in  (  1  )  . 

If,  for  every  sequence  (xn),  xn-^>l  implies  yn->l  (|£|<oo),  the  trans- 
formation (4)  or  (1)  or  the  matrix  A  is  said  to  be  regular.  %  The  corresponding 
factor  sequence  (jin)  is  then  called  a  regular  sequence. 

Necessary  and  sufficient  conditions  that  A  may  be  regular  are  :  § 

(i)  Mn  =  £  |  Anim  |  <  M,     (ii)    S  AnjW  =  p0  =  1,     (iii)  Anj0->0. 

ttt=0  m=0 

When  for  every  (xn)  the  convergence  of  (xn)  implies  that  of  (yn]  (the  limits 
not  necessarily  being  equal),  A  is  called  a  c-matrix  (convergence-preserving 
matrix)  and  the  corresponding  sequence  (/in)  is  called  a  c-sequence.  For  A 
to  be  a  c-matrix  (i)  alone  is  necessary  and  sufficient.  ||  Also  the  class  of 
c-sequences  is  identical  with  the  class  of  moment  sequences.  ^ 

3.   Hausdorff  definitions  of  Cesdro  and  Holder  means 

Let  C%  and  H^  denote  respectively  the  Cesaro  and  the  Holder  means  of 
order  a  >  —  1  for  the  sequence  (sn),  and  C"  and  H*  the  corresponding  matrices. 
Then,  as  in  (1), 


and 


*   (11),  416. 

t  The  quotient  AA'"1  =  A'~JA  will  be  written  as  A/A'. 

J  The  term  "matrix  A  "or  simply  "  A  "  will  henceforth  be  used  for  "method  of  sum- 
mability  w^hich  corresponds  to  the  matrix  A". 

§   (1),  79-81.  Condition  (i)  implies  that  An  M->0,  m=  1,  2,  .... 

||    (1),  81.  '  If   (1),  84  and  98. 

**  It  may  be  easily  shown  that  the  Hausdorff  definition  of  C*  for  a>  —  1  is 
identical  with  the  definitions  of  Cesaro,  Knopp  and  Chapman,  while  that  of  H*  is 
identical  with  the  definition  of  Holder  for  a  =  0,  1,  2, ....  For  the  older  definitions 
see  (10),  66-67,  70. 

SEE.  2.     VOL.  50.     NO.  2407.  DD 
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,  1    „.      Aa      M-t-a\ 

where  "»  '  2-'       A^(    n    )' 

and  ^ 


The  following  theorems  will  be  proved  : 

THEOREM   1.   For    -l<a<0,    C'*->  +  oot    implies   H%->  +  co;    but    the 
converse  is  false. 

THEOREM  2.   //  0<a<l,  then   H^-^  +  ao   implies    C*  -^  +  00;    but    the 
converse  is  false. 


THEOREM  3.  For  a>l,  <?£->*  +  00  implies  #*->  +  oo;  but  the  converse  is 
false. 

THEOREM  4.    //—  l<a</?<0,  then  H%  ->  -f-  oo  cfoes  no£  imply  C%  ->  4-  oo. 
THEOREM  5.    JTitf/i  0  <  a  <  /?  <  1  ,  C«  ->•  +  oo  (foes  wctf  imy%  #£  ~>  +  oo. 
THEOREM  6.    .For  1  <  a  <  /?,  //£  ->  +  oo  efoe<s  no^  imply  C^  ->  -f  oo. 

4.    Totally  monotone  sequence  and  totally  regular  Hausdorff  matrix 

The  transformation  (4),  when  regular,  is  said  to  be  totally  regular  §  if, 
for  every  sequence  (xn),  xn-+  +  co  implies  yw-^  +  oo.  When  (4)  is  totally 
regular,  we  shall  call  A  a  totally  regular  matrix  and  the  corresponding 
sequence  (fin)  a  totally  regular  sequence. 

We  require  some  properties  of  a  totally  monotone  function  and  its 
relation  to  the  corresponding  totally  monotone  sequence. 

A  sequence  (/in)  is  said  to  be  totally  monotone\\  (really  totally  monotone 

decreasing),  if  A^          ^      ,  ^  _    0       , 

fo/  An/^n^0     (m,n  =  0,  1,2,  ...)• 

A  function  fi(t)  is  said  to  be  totally  monotone  for  t  >  r,  ||  if  it  is  differentiable 
any  number  of  times  for  t  >  r  and 

(-l)»/fc(»)(«)>0     (w  =  0,l,2,...)j  (6) 

it  is  said  to  be  totally  monotone  for  t^  T,  if  it  is  totally  monotone  for  t  >  T 
and  fji(r)  ^  /I(T  4-  0).  By  the  repeated  application  of  the  mean-  value  theorem 
we  have  for  t>r,  ^  =  (_  1)re/,n)(£)^  (7) 

*   (1),  82.  f   (*)»  83. 

t  It  evidently  suffices  to  consider  the  case  with  +  oo,  since  a  change  of  sign  of 
every  sn  produces  a  change  of  sign  of  every  C%. 

§  (7),  232.  ||   (3),  188-189. 

*j[  Here  also,  as  for  the  sequence  (/*„),  A/*(«)  =  ju(t)—/i(t+  1). 
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where  £  is  some  number  between  t  and  t  +  n.  It  then  follows  that,  if  fi(t)  is 
totally  monotone  for  t^  T,  so  that  fi(r)  ^  fi(r  +  0),  the  sequence 


is  also  totally  monotone.  In  particular,  for  t^  0,  a  totally  monotone  function 
fi(t)  gives  a  totally  monotone  sequence  fin  =  jLc(n). 

Again,  for  a  given  c-sequence  (fin)  there  is  a  uniquely  determined  function 
X(u)  of  bounded  variation  in  0<tt^l,  satisfying  (2).*  In  particular,  if 
(an)  is  totally  monotone,  the  corresponding  function  x(u)  is  a  uniquely 
defined,  bounded,  non-decreasing  function  of  ^  in  O^w^l.*  Therefore, 
whenever  a  moment  sequence  (/in)  is  given,  the  corresponding  function  f.i(t) 
is  uniquely  defined  by  (3).  Moreover,  when  (fin)  is  totally  monotone,  as 
/*(0)  ^  /i(  -f  0)  and  /*(£)  is  differentiate  any  number  of  times  for  t  >  0,f  with 


=  I 
Jo 


'u)ndx(u)^Q, 
it  follows  that  /*(£)  is  also  totally  monotone  for  £  ^  0,  with 

The  moment  function  /i(t)  is  said  to  be  regular  when  (fin)  is  regular  and 
totally  regular  when  (ju,n)  is  totally  regular. 
We  shall  now  prove  the  following  lemma: 

LEMMA  1 .    A  regular  matrix  A  is  totally  regular,  if  and  only  if 

either     (i)  (/in)  is  a  totally  monotone  sequence, 

or          (ii)  /t(t)  is  a  totally  monotone  function  for  t  >  0. 

The  two  conditions  are  equivalent,  in  the  sense  that  one  implies  the  other. 

Proof.    The  proof  of  the  lemma  follows  immediately  from  our  previous 
remark  and  from  the  fact  that  A,  when  regular,  is  totally  regular,  if  and 


i.e.  Aw~w/^m^0     (m,n  —  0,  1,  2,  ...; 

i.e.  (an)  is  a  totally  monotone  sequence. 

*  These  are  the  results  in  connexion  with  the  well-known  moment  problem  of 
Hausdorff  (see  (1),  98-102,  or  (2),  222-232).  Actually,  x(u)  is  uniquely  defined  at  all 
points  of  continuity  except  for  an  additive  constant.  It  is  then  made  unique  in 
0  ^  u  <  1  through  suitable  normalization. 

f  In  fact,  if  t  is  complex,  fi(t)  is  regular  for  R(t)  >  0. 

J   (7),   243.    It  may  be  noted  here  that  for  a  general  regular  transformation 

n 

yn  =  S  an fmxm,  the  condition  for  total  regularity  is  that  an  m 5*  0  for  m^K  and  all  n. 
In  the  case  of  a  Hausdorff  transformation  we  have  K  =  0. 

DD2 
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If  we  put  ji(t)  =  ef**>, 

then  the  condition  (6)  is  also  satisfied  for  t  >  0,  if 

(-1)V»>(*)^0*     (*•=  1,2,3,...).  (8) 

Before  taking  up  the  proofs  of  the  theorems  we  shall  prove  one  more  lemma 
which  is  of  fundamental  importance  for  our  purpose. 

LEMMA  2.f    Two  regular  Hausdorff  transformations  with  non-vanishing 
moment  sequences  cannot  be  totally  equivalent,  unless  they  are  identical. 
Proof.   As  in  (1),  let  the  two  transformations  be 


and  !4=  S  ^/Cam     (»  =  0,  1,2,  ...), 


where  /*m4=0,     /4*°     (w  =  0,  1,2,  ...)• 

Then  by  (5),  we  have 

(0  /.=  s 

w-0 

and  (ii)y.-s 

m=o 

Suppose  now  that  the  two  transformations  are  totally  equivalent,  so  that 
?/n->Z  implies  y'n-*l  (  —  oo^Z<  +00),  and  conversely.  Then  (i)  and  (ii)  must 
both  be  totally  regular.  Therefore,  by  lemma  1  , 


o     (m  =  0,1,2,...)- 
Hence  ±i  ^  (m  =  0,  1,  2,  ...); 

f^m+i       fim 

and  so  ^  =  ^=1, 

Am        /*o 

since  the  transformations  are  regular.  Therefore, 

/*m  =  /4     (w  =  0,1,2,...), 
so  that  the  two  transformations  are  identical. 


j"  This  important  observation  was  made  by  Dr  Bosanquet  in  one  of  his  clas 
lectures  on  Divergent  Series. 
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5.    Proof  of  theorems  1,  2  and  3 

The  proofs  of  these  theorems  are  almost  similar  and  so  we  shall  give  a 
joint  proof  for  them.  We  require  the  following  lemma. 

LEMMA  3.   Let 

Mr(T)  =  r-l{T~n-(T  +  r)-n}     (r=hO). 

Then,  ifT^l,n>Qand  -  I  <r  <s  (r,s=J=0), 

Mr(T)>Ms(T). 
Proof.   We  have 

n(l        dy 
r*  ~    Jo  (T  +  ryY+i' 

and  the  integrand  is  a  steadily  decreasing  function  of  r  >  —  1 ,  since  T  ^  1 . 
Proof  of  the  theorems.  To  prove  theorems  1,  2  and  3,  let 

<r(t)  =  logr(*  +  a+l)-logr(a+l)-logr(«+l)-alog(*+l), 
and  define  p(t)  so  that 

p(t)  =  (r(t)    (-l«x<0),       =-cr(J)    (0<a<l),       =a(0    (a>l).    (9) 

For  the  proofs  of  theorems  1  and  3  we  consider  the  matrix  regular  H^IC* 
and  for  theorem  2  the  matrix  regular  <7a///a.  Thus  in  theorems  1  and  3  we 

take  A* 


___  _      (0  „      (0  „ 

a  (*+i)«"      "     ' 

while  in  theorem  2 


Now  for  ^  >  0, 

...      r^  +  a+1)  __  ^=         _  ____  __   _  . 

U       T^  +  a+1)      F(t  -  ----  '1 

and  hence  for  n  =  1,  2,  3, 


*  The  function  fi(t)  is  seen,  by  Carlson's  theorem,  (8),  186,  to  be  the  right  function, 
since  it  is  regular  and  bounded  for  R(t)  >  0  and  equal  to  the  moment  constant  fin 
for  t  =  n,  n  =  0,  1,  .... 

f  See,  for  example,  (8),  149. 
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since  (t+l)~n=  £  {(m  +  t+  l)~n-  (w-M+2)-n},  the  term-by-term  difter- 


entiations  being  justified  by  the  uniform  convergence  of  the  differentiated 
series,  It  then  follows  from  lemma  3  that  for  t  >  0, 


>0     (-1<a<°)'          <°     (0<a<l),      >0 


_    \T 

Therefore,  if  a  >  -  1,  by  (8)  and  (9),  since  <r(  +  0)  =  <r(0),  fi(t)  is  totally  mono- 
tone for  t^Q.  In  theorems  1  and  3,  since  the  matrix  #a/(7a  is  regular,  it  is 
totally  regular  by  lemma  1;  and  so  (7a/#a  cannot  be  totally  regular  by 
lemma  2.  Hence  C^  ->  +  oo  implies  H^  ->  +  oo  ;  but  H  %  ->  -t-  oo  does  not  imply 
O£->-f-oo.  For  theorem  2  the  matrix  C^/H*  is  totally  regular  and  so  H^jC0- 
is  not  totally  regular.  The  result  now  follows. 

6.    Negative  results 

The  remaining  three  theorems  deal  with  negative  results.  In  each  case, 
therefore,  we  ha-ve  to  show  that  the  corresponding  matrix  is  not  totally 
regular;  and  this  will  be  true,  by  lemma  1,  if  and  only  if  ju(t),  the  corre- 
sponding moment  function,  is  not  totally  monotone  for  t  ^  0.  Again,  to  show 
that  fi(t)  is  not  totally  monotone,  in  each  of  the  above  three  cases,  it  will  be 
sufficient  to  consider  the  sign  of/dn\t)  for  small  values  of  t.  We  shall  suppose 
then  in  what  follows  that  0  <  t  <  1  ;  and  we  shall  see  that  for  small  t,  ju,(t) 
does  not  satisfy  the  test  (6)  for  a  totally  monotone  function. 

Proof  of  theorem  4.  Here  we  consider  the  regular  matrix  C^/Ha  for  which 


Put  oc  =  —p,  fl  =  —  q;     then     0<q<p<I. 

We  have  now 


8ay, 


.  ..  - 

n\ 


that  is,  *(,)  .  1  +  2  ...  -  (10) 
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the  term-by-term  integration  being  justified,  since  all  the  terms  are  ^  0.* 
As  0  <  t  <  1,  expanding  t/(t  +  n)  in  a  power  series  in  t,  we  have  from  (10), 


We  have  to  justify  the  inversion  in  the  order  of  summation.  Denoting  the 
double  series  in  (1  1)  by  £  Z  vn  k>  we  nave 

n    k 


Since  0  <  t  <  1 , <  —7 

n  —  t     n—  I 

1^1           •        ^  #(?+ 1)  •••  (q  +  n—  1)     1  -j     ,1  i 

and  the  series    £ ii__  - — _  evidently  converges  as  g<l. 

Therefore  2  2 ^w,fc  converges  absolutely  for  0<t<l.  The  inversion  is  thus 

n    k 

justified. 
Putting 

00 

we  have  then     /^(t)  =  1  +  2  ( —  1 
Again,  fiz(t)  =  (1  +t)~p 

00 

= 1+  S  (-] 


Now  both  the  power  series  for  /^(O  and  /*2(0  are  absolutely  convergent  in 
0<£<  1.  The  product  of  //i(£)  and  /ta(0)  therefore,  exists  as  a  power  series, 
absolutely  convergent  in  0  <  t  <  1  ;  as  such  the  term-by-term  differentiation 
of  the  product  series  is  admissible,  as  often  as  we  please,  in  0  <  t  <  1. 

Since  fi(t)  =  Pi(t)fa(t), 

the  coefficient  of  tk  in  the  power  series  for  fji(t) 


,  .it 

cr(g)J 


*  See,  for  example,  (8),  347.  t  The  coefficient  of  ck(q)  is  to  be  taken  as  1. 
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Hence,  if  fjb(t)  be  differentiated  successively,  for  small  values  of  t,  the  sign 
of  plk\t)  will  be  the  same  as  that  of  (  -  1  )k  fik(pt  q).  Therefore,  for  p(t)  to  be 
totally  monotone  for  t  >  0,  we  must  have,  by  (6), 


(*  =  0,1,2,...). 
Then  fj,(t)  cannot  be  totally  monotone  if,  for  some  k, 


that  is        y  p(p+l)...(p  +  k-r-l)  p(p+I)...(p  +  k-l) 

that  is,      £  ^TJI  or(q)  >  W[  . 

Since  0  <  p<  I  , 

p(p+l)...(p  +  k-2)    p(p+l)...(p  +  k-l) 

-  (j^Ti)!  -  >^~  -  *!  --      (£  =  2,3,4,...). 

Hence  to  see  that  fi(t)  is  not  totally  monotone,  it  is  sufficient  to  prove  that 

-2    * 


or,  indeed  S  <V(tf)  >  l  »  (12) 

r-l 

for  some  &. 

Now,  from  the  definition  of  cA.+1(g)  it  follows  that 

c*te)>cfc+1te)     (A:  =  1,2,3,...), 
and  since  the  first  term  of  the  series  for  ck(q)  is  g, 

Cite)  >  cate)  >       >  c 
Hence  (12)  certainly  holds  ifkq>l,  that  is, 


Thus,  for  any  given  q,  values  of  k  can  be  found  so  that  (12)  holds.  Therefore 
cannot  be  a  totally  regular  matrix.  This  proves  theorem  4. 


7.    Proof  of  theorem  5 

The  matrix  for  consideration  is  now  H^\Ca  and 
A* 


Then 

men 

where 
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f1 
Now  /hW  =     uf+^l-u^du 


f  V 

Jo  I 


o  no 


__ 
~~M=o 


the  inversion  is  justified  since  all  the  terms  are  positive. 
Expanding  --  -  in  powers  of  t,  we  have 


We  can  justify  the  inversion  as  before,  for  0  <  t  <  l.J  We  now  write 


fc-O 

u 
where 

Again, 


,.... 

K  ! 

The  two  power  series  for  ^(t)  and  /*a(t)  being  absolutely  convergent  in 
0  <  ^  <  1  ,  their  product  series  is  also  so  in  0  <  t  <  1  ;  and  thus  this  last  series 
can  be  differentiated  term-by-term  any  number  of  times  in  0  <  t  <  1  .  Since 


the  coefficient  of  t  k  in  the  power  series  for  fj,(t) 


say. 


*  The  term  corresponding  to  n  —  0  in  the  sum  under  the  sign  of  integration  is 
to  be  taken  as  ut+a. 
t  (8),  347. 
J  Actually,  for  0  <  t  <  I  -t-  a  ;  but  for  our  purpose  we  have  already  supposed  that 


§  The  term  in  this  series  corresponding  to  r  =  1  is  to  be  taken  as  dfc(a). 
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Then  for  fi(t)  to  be  totally  monotone,  for  small  values  of  t,  we  must  have, 
by(6)>  *M(*,fl>0     (£  =  0,1,2,...). 

Therefore  /i(t)  cannot  be  totally  monotone  if,  for  some  ky 


that  is,  (1  -/?)  '^Jfc-r+ifr)  >dk+l(«).  (13) 

(1  _f  a     \  k~r+l 
—  t^L.  I          <i     (n=  1,2,3,...; 
a-f^+1/ 


l(a)     (r  =  1,2,  3,  ...), 

or  A(^L 

»fc+i(a) 

Hence  (13)  will  be  true  if,  for  some  k, 


As  the  series  in  (14)  is  a  part  of  the  divergent  series 


(14)  certainly  holds  for  k>  K,  with  some  fixed  J£.  Therefore,  HP  JO*  cannot 
be  a  totally  regular  matrix.  Theorem  5  now  follows. 

8.    Proof  of  theorem  6 
For  the  proof  of  the  last  theorem  we  require  three  more  lemmas.  We  write 


so  that,  if  v8>n  =  l/sn     (n,s  =  1,2,3,...), 

1          m  lm\        1 

then  Ar_%?o<_irg_T_.  (15) 

LEMMA  4.  (i)  (l/sp)  is  a  totally  monotone  sequence  for  p>0,  with 
A^>0     (m,*=l,2,3,...). 

(ii)  A?~>Ar^i,    forp>m     (5  =  2,3,4,...). 

Proo/.    (i)  Let  f(x)  =  1/a^     (a;>0). 
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Then  /«(*)  =  (  -  , 

so  that,  by  (7),  A™/(#)  >  0,     for     x>  0. 

Therefore  /(«)=!/«»     (5=1,2,3,...) 

is  a  totally  monotone  sequence  with  A™(l/sp)  >  0. 

(ii)  Let  0(a,)l=JL__i_     (a!>o). 


Then               0<* 
Hence  by  (7), 

if 
i.e.  if 
Thus 

n)(  \  —  (     i  \rn\P  +*•)•••  (P  T-  m  —  i)/       p-t-m 

&m<f>(x)  >  0,     for     x  >  2, 

iC 

^9  >  m. 
Am0(s)>0,     for    ^>m     (5  =  2,3,4,...), 

i.e.                     L 

^f—>^f—-=)     for    p>m     (s  =  2,  3,  4,  ... 

Proo/.    For,  by  (15), 


A« 

S 


«    \p     fm\  f    s 

^i)  +U)fcr 


y 


r2      --rHi 


LEMMA  6.*  //a  series  is  summable  (H,ot)  to  +00,  ^ew  i<  is  also  sum- 
mable  (H,a  +  fi)  to  +00,  where  a  +  /?>a>  —  1. 

Proof.  Consider  the  matrix  H^P/H"  for  which  the  factor  sequence 
is  given  by  ^  fo  +!)-*-/?  _  l 


*  The  corresponding  result  for  the  (C,  a)  method  is  well  known  (see  (7),  245). 
The  case  for  the  (H,  a)  method,  when  a,  /?  are  positive  integers,  is  also  known 
(see  (6),  31). 
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Since  /?  >  0,  (ju,n)  is  totally  monotone  by  lemma  4  (i).  As  the  matrix 

is  regular,  it  is  therefore  totally  regular  by  lemma  1.  Whence  the  result 

follows. 

Proof  of  theorem.  To  prove  theorem  6  we  have  to  show  that  the  regular 
matrix  C^/Ha  is  not  totally  regular  for  1  <a<ft.  However,  instead  of  using 
the  matrix  <7^///a,  we  shall  first  consider  one  in  which  ft  is  replaced  by  an 
integer. 

Let  k  be  any  positive  integer   ^2,  and  l<a^2  (a=f=&).  Then  for  the 


Put  a  =  l-fiy,  0<^<l,so  that 


Then  /i(t)  =  , 

or,  splitting  up  the  expression  for  fi(t)  into  partial  fractions,  we  have 

*  «  1  1 


If  the  right-hand  side  be  expanded  in  ascending  powers  of  t,  we  find  that 


the  coefficient  of  tn  in  ±—L  is 
r  +  t 


__  __ 

i     vrn          2!      r^-1     "•  w! 

Then  multiplying  by  (-  l)r-*k\/{(r-2)\  (k-r)\}t  for  r«  2,  3,  ...,&,  and 
adding,  we  see,  from  (16),  that  the  coefiicient  of  tn  in  the  expansion  of 
/*(*)  is 


_     _ 

(r-2)\(k-r)\ 

__     1      ^(l-y)    1  __         y(l-y) 
7r«          2!      r^-1     "* 


.  ..(n-l-iym 
7i!  rj 


The  above  process  of  multiplication  and  subsequent  differentiations  are  all 
justified  by  the  absolute  convergence  of  the  power  series  involved  for 
0  <  t  <  1.  In  order  that  p(t)  may  be  totally  monotone  we  must  have  then  for 
small  values  oft, 
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Therefore  fi(t)  is  not  a  totally  monotone  function  if,  for  some  n, 

that  is,  if 


Again,  by  lemma  4  (ii)  with  m  —  A;—  2  and  5  =  2, 

Tp>771>+1,     for    p>k-2; 

hence  TP/!TP+1>1     (p>k-2). 

Also,  putting  m  =  k—  2,  5=2  in  lemma  5, 

^>/TP+i^2>     as    ^->0°- 
A  number  c,  independent  of  n,  can  therefore  be  found  such  that 

l<c<;  least  ofTp/Tp+1)     for    p  =  fc-1,  k,...,n. 
Then 


(17) 

*  /£       9\    1         *-2  /£_9\  1 

No,  S-> 


=  Aj-2-      with  5  =  2,     by     (15), 

6* 

=  7;,     say     (w=  1,2,3,...). 
Then  (17)  may  be  written  as 

^.,+l^=3QT^+...+^-^";;*-1=g>rI>gu.        (is) 

Taking  m  —  fc  —  2  and  s  =  2  in  lemma  4  (i),  we  have 
Aj~2  —  >  0,     where     ,9  =  2, 

i.e.  2;>0     (p=  1,2,3,...)- 

Hence  (18)  is  true,  if 
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So  it  follows  from  (19)  that  /i(t)  is  not  totally  monotone  if,  for  some  n, 


Now,  since  the  series  in  (20)  is  a  part  of  the  divergent  series 


as  c>  1,  (20)  certainly  holds  for  n  >  N,  for  some  N.  Hence  the  matrix  CkjH* 
is  not  totally  regular,  i.e.  JEf*  ->  +  oo  does  not  imply  O£->  +  oo. 

Next  suppose  l<a^2,  /?>a.  Suppose,  if  possible,  that  #£->-t-oo 
implies  <7£->-hoo.  Then,  k  being  an  integer  ^/?,  as  the  (  (7,  /?)  method  is 
totally  included  in  the  (C,  k)  method,  (7£->-f-oo  implies  £7*->  +  oo.  Hence 
H  %  ->  +  oo  implies  C*  ->  +  oo  which  is  contrary  to  what  has  just  been  proved. 
So  for  1  <  a  <  2  and  fl>a,  H%  ->  +  oo  does  not  imply  C&  ->  +  oo. 

Lastly,  suppose  2<a</?.  Suppose,  if  possible,  that  #*H^  +  OO  implies 
(?£->•  +  oo.  Then  since  a  >  2  and  the  (H,  y)  method  is  totally  included  in  the 
(H,y')  method  for  y'>y>  —  1,  by  lemma  6,  //*->  +  oo  implies  j^*->-f-oo. 
Hence  ^T'-^-j-oo  also  implies  (7£~>  +  oo,  which  contradicts  our  previous 
result.  Therefore,  for  1  <  a  <  /#,  //*  ->  +  oo  does  not  imply  C&  -»•  4-  oo. 

This  completes  the  proof  of  theorem  6. 
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SOME  REMARKS  ON  THE  CONSTRUCTION  OF  CONTINUOUS 
NON-DIFFERENTIABLE  FUNCTIONS 

By  F.  A.  BEHREND 
[Received  1  February  1945 — Read  1  March  1945] 

Weierstrass's  function     f(x)  —  S  avcoabv7rx,  (I) 

as  well  as  a  number  of  other  examples  of  continuous  functions  without 
derivative  at  any  point,  are  of  the  form 


f(x)  =  S  a»g(bvx)     (0<a<  1,  ab^  1).|  (2) 


f  Besides  (i)  cos  TTO,  the  following  may  be  mentioned:  (ii)  sin  TTX,  (iii)   |sin7rx|, 
and  their  polygonal  analogues 


for 

(v)  X(a?)=l-a:,         for 

for 


for 

f      . 

for     ^  <  a?  <  1  , 


(iv)  and  (vi)  are  essentially  identical,  since 

Weierstrass's  original  example,  viz.  (i)  with  odd  integral  6,  is  reproduced  by 
Titchmarsh  (10).  Improvements  of  Weierstrass's  result  were  given  by  Bromwich  (4)  ; 
his  results,  as  well  as  similar  results  for  (ii),  (iii)  and  (iv),  for  integral  6,  are  found  in 
Knopp{6)  and  reproduced  in  Hobson(7).  Cellerier(2)  treats  (ii)  for  integral  b  and 
ab  =•  1;  this  example  also  is  reproduced  by  Hobson.  The  cases  (i)  and  (ii),  with  arbi- 
trary 6,  were  studied  by  Hardy  (5);  see  also  Dini(3).  Examples  of  (vi)  are  found  in 
van  der  Waerden(9)  (a  =  ^V,  6  =  10),  Landau  (12)  (a  =  i,  b  =  4),  and  Hildebrandt(ll) 
(a  =  £,6  =  2):  van  der  Waerden's  example  is  reproduced  by  Titchmarsh  (10). 
Caratheodory(S)  treats  (iv)  for  odd  6. 
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The  usual  study  of  these  functions  can  be  simplified  by  the  following 
remark  which,  though  rather  obvious,  does  not  seem  to  have  been  stated 
explicitly  in  the  literature:  f(x)  can  be  written  as 

/(*)=/„(*)  +  %(*),  (3) 

n-1 

where  fn(x)=  S  aW^*),  (4) 

v=0 

and  rn(x)  =  anf(bnx),  (5) 

i.e.  the  curve  y  =  rn(x)  is  obtained  from  y  —  f(x)  by  reducing  the  ordinates 
in  the  ratio  1  :  an  and  the  abscissae  in  the  ratio  bn  :  I  .  If,  therefore,  f(x)  is 
at  all  crinkly  in  the  large,  rn(x)  will  be  crinkly  in  detail,  and  indeed  —  if 
ab  >  I  —  more  crinkly,  since  the  gradients  of  the  chords  of  rn(x)  will  be 
(ab)n  times  the  gradients  of  the  corresponding  chords  of  f(x).  If,  on  the 
other  hand,  g(x)  is  not  too  crinkly  itself,  e.g.  satisfies  a  Lipschitz  condition, 
the  gradients  of  chords  offn  (x)  will  increase  like  (ab)n~l  at  most,  and  we  shall 
expect  that  for  sufficiently  large  ab  the  contribution  of  rn(x)  will  outweigh 
that  offn(x)  and  that  f(x)  will  be  non-differentiable. 

It  will  be  shown  in  the  following  that  this  is  actually  the  case  for  a  quite 
general  class  of  functions  g(x)  [Theorems  I,  II].  An  advantage  of  our  method 
is  that,  though  elementary,  it  does  not  require  the  restriction  of  b  to  integral 
values  usually  adhered  to  in  the  elementary  treatment.  As  an  illustration 
we  apply  the  method  to  Weierstrass's  original  example;  we  show  how  the 
known  elementary  estimates  and  better  ones  can  be  obtained;  it  will  be 
proved,  for  example,  that  (1)  nowhere  possesses  a  derivative  finite  or  in- 
finite if  ab  ^  7-05  for  arbitrary  real  b  [see  (30)]  or,  more  generally,  if 


-,  , 

2COS7T&!  X         6—1 

[see  (31)].  Hardy's  result  (5)  that  (1)  does  not  possess  a  finite  derivative 
for  ab  ^  1  cannot  be  obtained  in  this  elementary  way,  but  it  can  be  proved 
for  integral  b  and  for  real  b^^g.;  only  the  case  where  b  is  near  to  1  seems 
really  difficult  and  deep.  Corresponding  results  for  the  functions  mentioned 
in  our  first  footnote  are  listed  in  later  footnotes  and  after  Theorem  III. 
Theorem  III  establishes  a  rather  curious  property  of  a  certain  class  of 
piecewise  linear  functions  g(x)\  the  corresponding  f(x)  is,  for  ab  ^  1,  either 
non-differentiable  or  itself  piecewise  linear. 

We  distinguish  two  cases  according  to  whether  the  non-existence  of  a 
finite  or  infinite  differential  coefficient  or  merely  that  of  a  finite  differential 
coefficient  is  required. 
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1 .   Functions  without  derivative  finite  or  infinite 

As  a  sufficient  criterion  we  use  the  following:  the  continuous  bounded 
function  f(x)  does  not  possess  a  differential  coefficient  finite  or  infinite  at 
f  if  to  every  arbitrarily  large  C  >  0  correspond  two  values  x,  x'  such  that 

C,  (6) 

C.t  (7) 


Consider  a  function  f(x)  defined  by  (2);  assuming  g(x)  to  be  defined  for 
all  x,  continuous  and  bounded,  f(x}  will  be  continuous  and  bounded.  Sup- 
pose, further,  that  g(x)  satisfies  a  Lipschitz  condition 

for  all  x  and  £.  Then 


if  ab  >  1,  for  all  x,  £.  Suppose  that,  on  the  other  hand,  a  lower  measure  K  >  0 
for  the  "  crinkliness  "  of  f(x)  can  be  found,  i.e.  that  for  every  f  values  x, 
x'  can  be  found  such  that 


A(/;  *,£)>#,     A(/;  *',£)<  -AT.  (10) 

The  function  rn(x)  =  anf(bnx)  will  then  have  this  property:  to  every  £ 
correspond  x  and  x'  such  that 

&(rn'9x,£)^anbnK,     A(rn  ;#',£)«£  ~anbnK.  (11) 

From  (9)  and  (11), 

nnhn  I  M     \ 

A(/;  a,  f)  >  a«6-jr  -  Jf  ^y  =  a-6»  \K  -  ~~  j  ,  (12) 

i.e.  A(/;  »,  g)  becomes  arbitrarily  large  positive,  and  similarly  A(/;  x',  £) 
arbitrarily  large  negative;  and  f(x)  nowhere  possesses  a  derivative,  finite  or 
infinite,  if  ^f 


i.e.  if  ab>l  +  M/K.  (14) 

It  should  be  noted  that  K  may,  and  in  general  will,  depend  on  a,  6,  and  that 
(14)  cannot  necessarily  be  satisfied  by  giving  ab  sufficiently  large  values. 
For  the  determination  of  K  a  rough  study  only  off(x)  will  be  needed.  It  is 

•f  For  automatically  £  will  be  a  limit  point  of  such  x,  x'  . 
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usually  easy  to  determine  the  value  off(x)  for  special  values  of  the  argument; 
suppose  now  that,  for  every  f,  two  such  special  arguments  xlt  xz  exist 
such  that  x1  <  £  ^  x2,  x±  <  x2,  and 

Atfta^a^JT;  (15) 

then  A(/;  *;,£)>#, 

where  x  =  xl  or  rr2.t  Similarly,  if  x{,  x2  can  be  found  with  x[  ^  £  ^  #2,  #i  <  #2> 

and  A(/;a;,a?i)<-^.  (16) 

then 


for  #'  =  x[  or  #2-  Geometrically  this  means  that,  to  every  £,  two  chords 
"overlapping"  £  of  gradients  ^K  and  <  —  K  have  to  be  found.  J 

To  illustrate  the  procedure  consider  Weierstrass's  original  example  (1). 
Here  g(x)  —  cos  773;  and  M  —  n.  For  the  determination  of  K  observe  that 
(15)  only  has  to  be  satisfied;  (16)  follows  from  f(x)  =  f(  —  x). 

First  let  b  be  an  integer.  Since  f(x)  has  the  period  2,  we  can  restrict  £  to 
the  interval  0  <  £  ^  2.  Choose  xl  =  —  1,  x2  =  2,  then 


where  otj  =  1/(1—  a);  i.e.  (14)  reads 

a6>l  +  f7r(l-a),     for  odd  6,  (17) 

a6>l4-|7T,  for  even  6.  (18) 

If,  in  the  case  of  even  6,  we  choose  xl  —  —  f  ,  f(xj)  —  —  ^alt  we  get 
A(/;^,o;2)  =TVi  =  K 

and  a&>H-if7r(l-a).  (19) 

(17)  and  (18)  are  well-known  estimates  (Bromwich(4),  Knopp(6)),  while 
(19)  appears  to  be  new;  it  is  better  than  (18)  for  a  >  ^2,  but  asymptotically, 
i.e.  for  large  6,  the  lower  bound  for  a  given  by  (18)  is  better  than  that  given 
by  (19).  Slight  asymptotical  improvements  on  (17)}  (18)  can  be  obtained, 
but  they  go  only  just  to  show  that  the  constant  §77  is  not  the  best  possible 
estimate.  Thus  if  ju,  is  the  solution  of 

COtTT/4  =  (f  —  fi)n 

between  0  and  J  (approximately  0*0697),  and  Tc  is  even  and  |  bji  —  k  \  <  1, 

then,  taking  ,  ,,  rt     ,  ,, 

6  #!  =  -  1  +  k/b,     xz  =  2  -  kjb, 

t  For  A(/;  arlf  a;2)  is  a  mean  value  between  A(/;  xlf  £)  and  A(/;  ccg,  ^);  see,  for 
example,  Bromwich(4)  or  Knopp(6).  $  Knopp(6). 
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we  obtain  a  result  of  the  form  (17)  or  (18),  the  constant  f/r  being  replaced  by 


I  do  not  know  whether  this  is  the  best  possible  asymptotic  estimate.* 


*  Although  it  seems  likely  to  me  that  the  above  result  is  the  best  possible  asymp- 
totic estimate,  I  have  not  been  able  to  prove  anything  beyond  Hardy's  theorem  that, 
for  b  —  4m  +  1,  %  —  \,  f(x)  possesses  a  definite  infinite  differential  coefficient  if  ab^  I 
and  a(6+  1)<  2  (except  that  the  <  sign  can  be  replaced  by  <):  see  Hardy (5),  313. 

Some  estimates  about  the  other  functions  mentioned  in  the  first  footnote  are  listed 
in  the  following  table: 


d(x) 

M 

6 

i 

*, 

/(,) 

*» 

A<,LT,> 

Condition  (14) 
ab> 

Known 

estimates 
ab> 

2m  J 

-i 

1 

5 
•2 

-1 

1 

1 

* 

1  |        5n 

\    l  +  ;]77      (6,7) 

sin  TTX 

7T 

I 

1 

r> 

1 

1 

2 

J 

1  +  ^7T          (6    7) 

4m  +  3 

i 

5 

1-a 
1 

1  -a 
1 

3(1  -a) 
2 

I  +  ftjr(i  -j-a) 

" 

U 

1+0 

1  +  a 

3(1  +  «) 

2m  | 

-* 

5 

'* 

/ 

l  +  2a 

4 

\    4         (6,7)f 

x(.) 

1 

1 
4m  +  1 

5 

! 

i 

5 

1 

1  +  2o 

1     i    O/1         _\ 

J 

4//i   7\ 

4m  +  3 

1 

•2 

2(l-«) 

2(1  -a) 

1 

1 

(Oj  '/ 

2 

2(l  +  o) 

2(l+a) 

3(l+o) 

1  +  3(1  +  0) 

2m  J 

0 

o 

i 

0 

0 

1 
V3 

•i 

3V3 

1  +         f  1     o 

}  »„  ,« 

|  sin  nx  \ 

7T 

I 

2m+l 

0 

1 

0 

1 

2 

J 

l  +  |w    (6,7) 

2m  | 

0 

0 

f 

4 

0 

Q 

1 

J 
1 

4 

|     4          (6,7) 

*»>t 

1 

I 

2m  +1 

Q 

/i 

Q 

3(1  -a) 

1 

4(l-a) 

1 

J 
4          /A  7\f 

" 

2(1  -a) 

3d-«) 

f  The  condition  ab>  I  given  by  Hobson(7)  is  erroneous;  as  in  Hardy  ((5),  313)  (see  the  above 
remark),  it  can  be  shown  that  f(x)  has  a  definite  infinite  derivative  at  certain  special  points  when 
06  >  1  and  a(b  +  2)  <  3,  at  x  =  0  if  g(x)  is  #(#),  at  x  -  J  if  g(x)  is  i/r(x). 

J  By  the  remark  in  the  first  footnote  the  same  result  holds  for  <j>(x)  as  for  ft(x). 

EE2 
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Now  let  b  be  arbitrary,  but  >  3.  Choose  e  so  that 


and  put  d=  cos  776.  (21) 

Then,  cos  nx  ^  8  >  0  in  any  interval 

2m-e^a?<2m-fe     (m  integral).  (22) 

We  shall  call  such  an  interval  a  £-peak  of  cos  nx.  In  every  interval  of  length 
^  2  -f  2e  lies  at  least  one  complete  £-peak.  It  follows  that  every  interval  of 
length  $?  (2  +  2e)/bn  contains  at  least  one  complete  (aw#)-peak  of  an  cos  bnnx. 
Now  the  length  of  an  (an~1<5)-peak  of  a71"1  cosbn-17rx  is 


2e        2e       2e 

(23) 


consequently  each  (aw™15)-peak  of  a71-1  cos  bn~-17rx  contains  a  complete 
(an£)-peak  of  ancosbnnx.  Starting  with  any  peak  (22)  of  COSTTO:  we  can 
construct  successively,  for  n  =  1,2,3,  ...,  a  chain  of  peaks  of  ancosbn7rx, 
each  of  which  is  contained  in  the  previous  one;  all  these  intervals  will  have 
a  point,  #2m  say>  ^n  common  for  which 

a'-Y^r,     \2m-x2m\^e.  (24) 

— 


Similarly  for  every  2m  -f  1  ,  x2m+1  can  be  found  such  that 

-aWnl'Se.  (25) 


Since    xk^k  +  e<k  +  ^<k+l—  e^  xk+1,    every    g    lies    in    an    interval 
fem+nawJ;  hence 

o  »  e\  & 


and  we  have  this  result:  Weierstrass's  function  (I)  does  not  possess  a  differ- 
ential coefficient,  finite  or  infinite,  if  for  some  e 


(27) 
^1+e-1,  (28) 

O     i     Op 

l-a).  (29) 

x 


These  conditions  can  be  satisfied,  for  example,  if 

5.  (30) 
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For,  with  e  =  £,  $  =  cos  ^n  =  J  -^3,  we  have 

b>ab^  7-05  >  7  =  1  +  e-1, 

Oil  O     I     Of 

and  a6^7-05>l4--^7r>l+~^ (1-a). 

^/3  2  cos  Tre v 

Jf  ore  generally,  if  e  is  chosen  as  61}  £Ae  condition  obtained  is 


(31)  is  less  restricted  than  Dini's 

,      ,      ,     1 —a 
ab>  1  -f  |  n 

(loc.  cit.  (3)),  and  stronger  in  every  case  where  Dini's  estimate  is  applicable. 
It  is  not  contained  in  Hardy's  results  (loc.  cit.  (5)),  since  they  are  concerned 
with  the  non-existence  of  a  finite  derivative  only. 

It  is  obvious  that  the  same  procedure  can  be  applied  if  cos  rrx  is  replaced 
by  any  of  the  functions  mentioned  in  the  first  footnote,  or,  indeed,  by 
any  continuous  periodic  function  g(x)  satisfying  a  Lipschitz  condition  (8).f 
In  fact,  the  periodicity  of  g(x)  is  not  required,  and  may  be  replaced  by  the 
following  property  of  "pseudo  -periodicity":  two  constants  A,  B,  A>  B 
exist  such  that  in  every  interval  of  length  p  two  points  a,  ft  can  be  found 
for  which  g(oi)  ^A,  g(/3)  <  B.  Then  peaks  round  all  a's  and  valleys  round  all 
/?'s  can  be  constructed  as  follows.  Choose  p  such  that 

0<p<%(A-B),  (32) 

and  e  =  pjM.  (33) 

Then 

{gW-gWl^Me  =  p,     g(x)^g(a)-p^A—  p  =  6"     (\x  —  a|<e), 


and  since  d  —  S'  =  A  —  B  —  2p>0,  by  (32),  peaks  and  valleys  do  not  overlap. 


f  The  following  results  may  be  mentioned  : 

(ii)  g(x)  =  sin  nx;  evidently  the  same  estimates  will  hold  as  for  g(x)  —  cos  TTX. 

(iii)  g(x)  =  |  sin  TTX  \  ;  here  the  "  valleys  "  round  x  =  m  will  show  a  different  behaviour 
from  the  "peaks"  round  x  =  (w  +  i);  modifying  the  procedure  accordingly,  (31)  will 
be  replaced  by 


cos  ^nbl  —  sin  ^TTOI 

which  is  satisfied  for  a6^7-91. 

(iv),  (v),  (vi)  g(x)  —  <fi(x),  x(x),  or  ft(x).  The  procedure  will  be  the  same  in  all  three 
cases  and  exactly  analogous  to  that  for  cos  TTX  ;  the  estimate  obtained  is 


which  is  satisfied  for  a6  >  6-6. 
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It  follows  that  every  interval  of  length  ^p  +  2e  contains  a  complete  peak 
and  a  complete  valley  of  g(x).  Choosing 

b>(p  +  2e)/2e,  (34) 

and  constructing  a  sequence  of  nested  peak  (or  valley)  intervals  of  ang(bnx) 
as  above  for  an  cos  bn7rx,  we  find  for  every  a,  ft  points  xa,  x^  for  which 


(36) 

and  in  every  interval  of  length  ^  p  H-  2e  at  least  one  pair  a?a,  x^  can  be  found. 
Choosing  now,  for  any  £,  the  largest  Xp^£,  and  the  smallest  xa  ^  £,  we  have 

(37) 


and 

Mf.~    ...)-/(*.)-/(«/)>  S~S'          -      A-B-2p      _ 

*V,*.,Xf)-      Xa,x-^2(p  +  2e)(l-a)~Z(p  +  2e)(l-a)          ' 

(38) 

and,  similarly,  for  appropriate  a;a/,  x^, 

^(flx^XpX-K.  (39) 

Condition  (14)  then  reads 


In  order  to  satisfy  (34)  choose 

.  (41) 


Then  (40)  becomes       a&>  1+    .      p     J  ,   (1  -a).  (42) 

^L  —  J3  — 

Here  6  is  restricted  by  (32), 


Both  (42)  and  (43)  are  certainly  satisfied  if 

2Mpbb, 


/AA^ 
l  (44) 

i.e.  for  sufficiently  large  ab.  Thus  we  have 

THEOREM  I.    Ifg(x)  is  defined  for  all  x,  continuous,  bounded,  and  "pseudo- 
periodical",  and  satisfies  a  Lipschitz  condition,  then 


does  not  possess  a  derivative,  finite  or  infinite,  at  any  point  if  ab  is  chosen 
sufficiently  Jar  ge,  e.g.  according  to  (44). 
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2.    Functions  without  finite  derivative 

As  a  criterion  we  use  the  following :  f(x)  does  not  possess  a  finite  differential 
coefficient  at  £  if  a  constant  C  >  0  exists  such  that  in  any  arbitrarily  small 
neighbourhood  of  £  two  values  x,  x'  can  be  found  for  which 


;'— £ 


(45) 


Again  let 


Suppose  that  g(x)  satisfies,  instead  of  (8),  a  "Lipschitz  condition  of  the 
second  order",  viz. 

;x,*)\*N(\x-£\  +  \x'-t\)  '  (46) 


for  all  £,  x,  x'.  This  will  be  the  case,  for  example,  if  g"(x)  exists  and  is  bounded. 
For  if  |  g"(x)  \  ^  ZN,  then 


x-l 


g'(g)  (»'  -  g)  +  to'(»'  )  («'  - 


From  (46) 


bvx-~bvg 


(47) 

provided  that  ab2  >  1 .  Suppose  that,  on  the  other  hand,  a  rough  lower  bound 
of  A(/;  £;  x,  x')  can  be  found  in  the  following  form:  for  every  £  values  x,  x' 
with  \x  —  £  |  +  |  #'  —  £\^E  exist  such  that 

(48) 
(49) 


and,  moreover,         A(/;  g;  sc, «')  >  I/(  |  x  —  £  \  +  |  x'  —  £  \ ), 


where  E,  D,  L>0  are  independent  of  £.  The  function  rn(x)  =  anf(bnx)  will 
then  have  this  property:  for  every  £  values  x,  x1  exist  such  that 

(50) 

!*'-£!)•  (si) 


f  See  Bromwich  (4),  where  this  result  is  used  in  the  case  of  g(x)  —  cos  nx. 
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As 


E 


xt  x'  will  lie  in  an  arbitrarily  small  neighbourhood  of  £  if  n  is  sufficiently 
large.  Also 


n;  g;  x,x')-N 


anb2n 


for  all  w,  i.e.  /  nowhere  possesses  a  finite  derivative  if 


(52) 

(53) 

The  constants  L,  D  can  usually  be  determined  in  the  following  way: 
consider  any  £;  choose  suitable  points  xlt  #2>  x[,  x%  for  which  the  values  of 
/  are  known,  such  that 

(54) 


and  J^L-l™  _Jj^L_^pj.  =  £>*>()',  (55) 

then  A(/;g;oj,  &')>/)*,  (56) 

where  x  is  one  of  x^  x2,  x'  one  of  x{,  x'z.  This  follows  as  under  (15)  and  (16). 
The  value  of  |  x  —  £,  \  4-  |  xf  —  £  |  depends  on  the  values  of  x,  x'  which  again 
depend  on  the  position  of  the  point  P  —  (£,/(£)).  If,  as  will  generally  be 
the  case,  the  chords  used  in  (55)  intersect  at  a  point  8  —  (&,T),  four  cases 
can  be  distinguished  according  to  the  position  of  P  in  one  of  the  four  regions 
shown  in  Fig.  1. 

Choose  x  and  xf  as  shown  in  the  following  table. 


Pin 

x 

x' 

\x-t\  +  \x'-£\ 

I 

xz 

x{ 

x2~x{ 

II 

xz 

x'z 

x2+x'2  —  2g^xz  +  x'2  —  2or 

III 

xl 

x'z 

x'z~xl 

IV 

*l 

*l 

2£-(xl  +  xi)^2<r-(xl  +  x'1) 

If  the  maximum  value  in  the  last  column  is  E*t  we  have,  in  all  cases, 

Irf  £"   I      I      I    /y'   £   I    <••     '. 

^        b|>|*(/          fe|^:J 

A(/;  £;  x,  x')         D* 

^JJQJ  -  -_       .  ~~    ^   "W-fc 


(57) 
(58) 


1945] 


CONTINUOUS  NON-DIFFERENTIABLE  FUNCTIONS 


473 


Generally,  the  same  chords  cannot  be  used  for  every  £,  and  one  has  to 
choose  L  as  the  minimum  of  all  L*,  D  as  the  minimum  of  all  Z>*. 

As  an  illustration,  we  give  a  proof,  by  our  elementary  method,  of  Hardy's 
result:  if  0<a<  1  and  b  is  an  integer  >  1,  then  Weierstrass's  function  (I) 
nowhere  possesses  a  finite  derivative  if  ab^\."\ 


We  have 


d2 cos  nx 


dx* 


i.e. 


(59) 


Fig.  1. 


As  before,  it  is  sufficient  to  consider  the  interval  0  ^  £  ^  1.  Choose  x1  —  —  1, 
x2  =  2,  x[  =  0,  x'z  =  1.  Then, 


whence 


D  =  Z>*  =  f  f  ^  =  f  '' 

If     +2     =|, 


''     if6is(°dd' 
I  even. 


The  two  chords  intersect  for  a  =  J;  thus  E*  —  2  and 

•>*       t*av      ...  .    (odd, 
if  6  is  \ 


Hence  (52)  and  (53)  become 

«62>  1-f  |7T2(l-a)     (a&^l),     if6isodd,t 
.  i  +  |7r2)  (aft  ->  i)?     if  ft  is  even.t 


(60) 
(61) 


t  Loc,  cit.  ((8),  304). 

|  Bromwich  (4)  only  obtains  |7r2  instead  of  f  zr2  because  he  uses  |  x2  —  x:  \  +  |  x£  -  x{  \ 
instead  of  \x-g\  +  \x'-g\. 
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But  for  b  >  5,  ab  ^  1  implies  the  other  condition,  since 

ab2  =  ab.b  ^  5  >  I  +  f .  10 >  1  +  |7r2  >  1  +  §7r2(l  -a). 

Thus  only  the  cases  6  =  2,  3,  4  remain  to  be  considered. 

6  =  3.  We  have  a  ^  £,  A  =  04  ^  f .  Since /(^  +  #)  =  —  /(£  —  #)  we  have  only 
to  study  the  interval  0^g<£;  in  order  to  obtain  a  sufficiently  sharp 
estimate  we  have  to  divide  the  interval  as  shown  in  the  following  table. 


-i 

i 


-A 
I-A 


-A 
0 


D* 


.27* 


=  ^,  and  condition  (52)  reads 


Hence  JD  = 


and  is  satisfied  because  ab2  —  ab. 

6  =  4.   Here  a  ^  J,  A  =  «j  ^  f,  and  the  interval  0  <  g  <  1  has  to  be  con- 
sidered; we  get 


, 


1-iA 
1-iA 
A-l 


1-1A 
A-l 
A-l 
A-2 


A 

1-iA 
A-l 
A-l 


A-2 


£7* 


(f  A-  2)  6+1.3^3 
0+  ($A-1)  6^6 
(|A-2)  3+1.2^2 


,  and  the  condition 


Hence  D  =  2>0,  JO 


is  satisfied  because  aft2  =  ab  .  b  >  4. 

6  =  2.  Here  a  ^  J,  A  =  ax  >  2;  we  shall  also  use  the  inequalities 
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*<£«* 


-i 


A-l-2a 


1-iA 

A-l-2a 

A-2 

/(£) 
A-2 


A-l-2a 

M) 

A-l-2a 


1-iA 


z>* 


*<£<* 


0+(fA-l)3>6 
(|A-2(l-fa))  6+1.3^3 


+  - 


1.6  =  4 


1- 


£—  +  — -: 


6 
6 
9 

6 
¥ 


}  and  the  condition 


Hence  D  = 


is  satisfied  because  «62  =  «6  .6  ^  2.  This  proves  the  statement. 

Now  let  b  be  arbitrary,  0<e<  J,  6>  1  +  e"1.  We  use  the  special  points 
xk  constructed  above;  let  Pk  =  (xk,f(xk)),  P  =  (g,/(5));  then 


-  *fc  |  ^  e,    f(x2 


for  all  integral  k,  m.  Suppose  that  g  is  in  the  interval 
case  xzm+1  <  g  <  ^2m+2  ig  treated  similarly).  Let 


(62) 
(63) 


m-i  =  (2m  -  1  -  6,  -  atej,     62m  =  (2m  -  e,  AiJ, 

Ga»+i  =  (2m  +  1  +  e,  -  A^),     Q2m+2  =  (2m  +  2  +  e,  to^). 


meet  at  ^  =  (2m  "*"  i»  °)»  ^  ^es  m  one  °^  tne  regions 
I,  II,  III  and  IV  shown  in  Fig.  2.  If  P  lies  in  I,  we  have 


;  £;  *2m+2>  *2m)  >  gradient  ^+2  R  -  gradient  QZm  R 
B  S  St(l+e) 


and 
thus 


(3  +  2e)(l  +  2e)  (I -a)' 


(64) 
(65) 
(66) 
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If  P  lies  in  II,  we  have 

A(/;  &  *W 2>  *Wi)  >  gradient  £2m+2  #  -  gradient  g2m+1 7?  =  D     (67) 

i  =  JET,  (68) 


and  the  same  values  are  obtained  for  III  and  IV.  Thus  the  value  (66)  of  L 
holds  for  all  £,  and  the  conditions  for  the  non-existence  of  a  finite  differential 
coefficient  become 


SCOSTTC 


-a),     ab  >  1,  0<e<  i, 


(69) 


Satisfying  the  last  condition  by  putting  6=1  4-e"1,  i.e.  e  =  6l5  we  find  that 
Weierstrass's  function  ( 1 )  nowhere  possesses  a  finite  differential  coefficient  if 


8  COS  7T01 


1 


(70) 


PH.M 


Fig.  2. 


This  is  stronger  and  less  restricted  than  Dini's  (loc.  cit.  (3)) 


but  considerably  weaker  than  Hardy's  ab  ^  1  (^oc.  ci^.(5)).   Hardy's  result 
can  only  be  established  in  this  way  for  b  ^  60  (e.g.  6  >  ^).  The  conditions  • 


(71) 
are  certainly  satisfied  if  b  >  K,     ab  ^  1 ;  (72) 

for  the  first  inequality  in  (71)  is  equivalent  to  a>(\  +/c)/(62  +  Ar),  and, 
since  a^  1/6,  this  is  satisfied  if 

1      1-h K     .         (6—1)  (6  — AC)  . 

>  0     i.e.     6  >  K. 
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Hence  (70)  will  be  satisfied  if 

p+yjd  +  afc,)  } 

8  cos  ^ 
which  imply  6^3;  and  it  is  easily  verified  that  (73)  holds  if 

6>3£,     ab^l.  (74) 

The  same  procedure  obviously  applies  to  more  general  functions  g(x) 
and  the  result  will  be : 

THEOREM  II.  //  g(x)  is  defined  for  all  x,  continuous,  bounded  and  "pseudo- 
periodic",  and  satisfies  a  "  Lipschitz  condition  of  the  second  order  ""j*  then 

f(x)=  SaW*)     (0<a<l), 

v  =  0 

nowhere  possesses  a  finite  derivative  if  ab  ^  1  and  b  is  sufficiently  large. { 

It  should  be  noted  that  this  theorem  does  not  apply  to  functions  g(x) 
which  are  piecewise  linear  (polygonal),  since  such  functions  do  not  satisfy 
a  Lipschitz  condition  of  the  second  order.  However,  we  can  prove 

THEOREM  III.  //  g(x)  is  piecewise  linear  (polygonal)  but  not  constant,  and 
periodic  with  period  1 ,  with  a  finite  number  of  vertices  in  the  interval  0  <  x  ^  1 
all  of  which  have  rational  abscissae,  then 

f(x)  =  S  <*vy(bvx) 

y=0 

is  a  continuous  function  without  finite  derivative  at  any  point  for  all  a,  b 
satisfying  the  conditions 

0  <  a  <  1 ,     b  an  integer,     ab  >  1 , 

with  the  possible  exception  of  a  finite  number  of  cases  in  which  f(x)  is  itself 
a  polygon. 

Proof.  (1)  LEMMA.  ///(£)  is  continuous  for  a <£</#,  and  not  linear  in 
this  interval,  then  there  is  a  constant  D>Q  such  that 

A(f;&x,x')>D  (75) 

for  all  £  and  suitably  chosen  x,  x'  in  the  interval. 

•f  Which  implies  a  Lipschitz  condition  of  the  first  order  needed  for  the  construction 
of  special  points. 

J  It  may  be  added  that  the  non-existence  of  a  finite  right-  and  left-hand  derivative 
for  large  b  and  ab  ^  1  can  be  established  in  the  same  manner ;  one  has  only  to  choose 
Xj  x'  on  the  same  side  of  £  which  will  result  in  a  smaller  value  of  L,  i.e.  in  a  less 
favourable  estimate  for  6. 
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For  choose  y  so  that  the  points  (a,  /(a)),  (/?,/(/?)),  (y,/(y))  are  not  col- 
linear;  then 

A(/;  £;  x, x'} ^ -^ —   =  Dl>0 

a  — y  a—p 

A(/;£;*,3)2*    -^~ a-/?  "    =I)2>0 

where  #,  a/  are  suitably  chosen  from  a,  /?,  y;  then  Z>  =  min(Z>1,D2)  will 
satisfy  the  requirement. 

(2)  f(x)  is  continuous,  with  period  1;  write,  as  above, 

f(x)=fn(x)  +  rn(x),  (76) 

where  fn(x)  =  ^  avg(bvx)  (77) 

v=0 

is  a  polygon,  and  rn(x)  —  anf(bnx).  (78) 

(3)  Suppose  that  f(x)  is  linear  in  some  interval  a  ^  x  ^  /?,  /(a?)  =  ?(a:),  say. 
Choose  w.  so  that  bn(ft-ot)  ^  1;  then 

aV(6»a?)  =  rn(a;)  =f(x)-fri(x)  =  l(x)-fn(x) 

will  be  a  polygon  for  a^x^fi,  or,  with  6n#  =  X,f(X)  will  be  a  polygon  for 
6wa  ^  JT  ^  6*7?,  i.e.  in  an  interval  of  length  bn(fi  —  a)  ^  1 ;  but  since  /  has  the 
period  1  it  will  be  polygonal  throughout.  Assuming,  then,  that  /  is  not  a 
polygon,  /  will  not  be  linear  in  any  interval. 

(4)  Now  let  xp  (p  =  1,2,  ...,r)  be  the  abscissae  of  the  vertices  of  g(x) 
mO<x^l,  and  0<Xl<x^< ...  <xr^  1.  (79) 
They  may  be  written  in  the  form 

where  pp,  q  are  integers,  q  a  common  denominator.  Then 


will  be  a  polygon,  and  the'abscissae  of  its  vertices  will  be  of  the  form  m/qbv 
and  can  all  be  written  with  the  common  denominator  qbn~l\  hence  /n(£) 
will  be  linear  hi  any  interval  of  the  form 

fc/te**-1)  ^  £  ^  (fc + iJ/te*"-1),  (si ) 

and  A(/n;g;a;,^)  =  0  *  (82) 

for  any  £,  x,  x'  in  this  interval. 
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(5)  If  f(x)  is  not  a  polygon  it  will,  by  (1)  and  (3),  possess,  for  every  one 
of  the  q  intervals 

(m  =  0,  1,  ...,<?-  1)  (83) 


a  constant  Dm>  0,  satisfying  the  lemma;  D  —  min(Z)0,Z)1,  ...,Dfl_i)  can  be 
used  simultaneously  in  all  q  intervals,  and,  because  of  the  periodicity  of/, 
for  all  intervals  (83)  with  arbitrary  integral  m.  Thus 

(84) 


for  any  £  and  suitable  x,  x'  in  (83). 

Passing  from  f(x)  to  rn(x)  =  anf(bnx),  we  see  that,  if  £  is  in  an  interval 
of  the  form  ml(qbn)  ^  £  ^  (m  +  b)/(qbn),  (85) 

then  A(rn;  £;  »,  a:')  ^  an6nZ)  ^  Z)  (86) 

for  suitable  x,  x'  in  (85).  Now,  for  every  n,  £  will  be  in  an  interval  (81),  and 
(81)  is  of  the  form  (85);  hence  (86)  can  be  satisfied  within  (81),  and  at  the 
same  time  (82)  will  hold,  so  that 

A(/;  £;*,*')=-*(»-,,;&  *,*')»#  (87) 

for  suitable  x,  x'  in  an  arbitrarily  small  neighbourhood  of  £,  which  proves 
that  /  nowhere  possesses  a  finite  derivative. 

(6)  Assume  now  that,  for  some  value  of  a,  6,  f(x)  is  a  polygon;  let  tff 
(cr  =  1,  2,  ...,«$)  be  the  abscissae  of  its  vertices  in  0<#<  1.  We  may  assume 

that  (7(0)  =  O.f  (88) 

Then  /(O)  =  0  (89) 

and  g(x)  =  hx      (0  <  x  ^  xj,  (90) 

f(x)  =  Hx     (0  <«<«!>.  (91) 

Now  g(x)=f(x)-af(bx).  (92) 

The  first  vertex  xt  ofg(x)  must  coincide  with  the  first  vertex  tjb  of  the  right- 
hand  side;  hence  b  =  ^^  <  ^^  (93) 


This  leaves  only  a  finite  number  of  values  for  6;  for  any  one  of  these  values 
we  have,  by  (90),  (91)  and  (92) 

hx  =  Hx-aHbx  (94) 

for  0 ^ x < xlt  whence  h  =  H(l-ab).  (95) 

f  If  0(0)  4= 0,  consider  O(x]  =  g(x)  -  g(0) ;  then  F(x)  =  2  avO(bvx)  =f(x)  -/(O). 
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Assuming  that  ab  >  1  (which  excludes  at  most  the  finite  number  of  cases 
a  —  6"1,  which  can  only  occur  when  h  =  0),  we  have 

*--  (96) 


(97) 

for  some  integral  k,  n.  Choose  the  smallest  n  for  which  there  is  a  k  satisfying 
(97):  then  n^l  since,  by  (88),  g(k/bn)  =  0  for  n^O.  Further,  choose  the 
smallest  positive  k  corresponding  to  this  n]  and  let  m  be  the  smallest 
integer  ^  n  for  which 

&/&«<*!.  (98) 


(If 


(ioo) 


(100)  is  not  satisfied  identically  because  the  coefficient  of  am~n+1}  namely 
—  bg(k/bn),  is  not  0;  hence  there  are  at  most  m  —  n  +  l  values  of  a  satisfying 
(100).  Adding  the  possible  value  a  «=  1/6,  we  see  that  for  each  b  there  are 
at  most  m  —  n  +  2  values  of  a  for  which  f(x)  can  become  a  polygon. 

This  estimate  is  rather  crude  and  could  certainly  be  improved  by  finer 
considerations.  But  it  should  be  noted  that  the  occurrence  of  exceptional 
values,  for  which  f(x)  is  polygonal,  cannot  be  ruled  out  altogether;  in  fact, 
every  f(x)  can  be  represented  as  J^avg(bvx)  for  any  given  a,  6;  one  has  simply 
to  define  g(x)  by  (92),  and  if  /(#)  is  polygonal,  g(x)  automatically  becomes 
polygonal.  But  in  certain  cases  the  existence  of  exceptional  values  can  easily 
be  disproved.  If,  for  example,  g(x)  ^  0,  (/(O)  =  0,  and  h  >  0,  the  case  a  —  1/6 
cannot  occur,  f  and  (99)  cannot  be  satisfied  by  any  a  between  0  and  1 
because  its  two  sides  are  of  different  sign. 

As  examples  consider: 

(1)  g(x)  =  ^(a;);}  we  have  ^(0)  =  0,  h  =  1  >  0,  and  i/r(x)  ^  0.  Hence 

f(x)  =  Z 


nowhere  possesses  a  finite  derivative  for  integral  6  >  1  and  ab  ^  1  .  This  result 
includes  the  examples  given  by  van  der  Waerden(9),  Landau  (12)  and 
Hildebrandt(ll).  The  same  result  holds  for  <j)(x).% 

t   See  the  remark  after  (95). 

$  As  defined  in  the  first  footnote  of  the  paper. 
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(2)  g(x)  —  x(x)'y"\  consider  the  corresponding  function  with  period  1: 

x  for  0  ^  x  <  J, 

\-x         f 

-l+x     f 


Here  ajx  =  J;  hence,  for  exceptional  b,  6  <  4,  g(0)  =  0,  h  =  1,  and  hence 
ab>l. 

Now  for  6  =  2,  g(\k)  =  0,  0(J)  =  J^O,  and  J<  J  =  2^  =  ^;  and  hence, 
with  &  =  1,  m  =  7i  =  2,  (100)  becomes 

(l-a6)<7(J)-i  -  (i-a&)i-i  =  0 
with  the  only  solution  a  =  0.    And  for  6=3, 
Now  if  A;  =  1,  m  =  w-  =  1,  (100)  becomes 


with  the  only  solution  a  =  —  ^.  Hence 

/(*)  =  S  «' 

v  =  0 

nowhere  possesses  a  finite  derivative  for  integral  6  >  1,  and  a6  >  1. 
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NOTE  ON  CONVERGENCE  AND  SUMMABILITY 
FACTORS  (III) 

By  L.  S.  BOSANQUET 

[Received  and  read  14  November  1946] 

1 .    Introduction 

In  recent  notes*  I  have  discussed  the  problem  of  finding  necessary  and 
sufficient  conditions  to  be  satisfied  by  a  sequence  {ew}  in  order  that  the  series  f 
%anen  may  be  summable  (C,fi)  whenever  Sa^  is  summable  (C,  a).  In  the 
case  where  a  and  j$  are  integers,  necessary  and  sufficient  conditions  have 
been  formulated  without  proof  by  Schur,  J  and  in  (7)  and  (8)  I  have  given 
proofs  of  Schur's  theorem,  and  an  extension  to  series  whose  partial  Cesaro 
means  are  o(nP),  where  p  is  any  real  number.  In  the  case  where  a  =  /?  =  /c, 
and  K  is  an  integer,  the  sufficiency  part  of  the  theorem  is  the  Bohr-Hardy 
theorem,  found  independently  by  Bohr§  and  Hardy,  ||  and  the  necessity 
part  is  a  result  of  Fekete.^f  In  the  case  where  ft  —  0  and  oc  is  an  integer  the 
sufficiency  part  is  due  to  Bromwich,**  and  the  necessity  to  Kojima.ff  Brom- 
wich's  theorem  was  extended  by  Chapman  JJ  to  the  case  where  a  is  fractional, 
and  Andersen§§  extended  the  Bohr-Hardy  theorem  similarly,  stating  the 
corresponding  result  with  a  parameter  p  ^  0.  I  have  completed  Andersen's 
result,  in  necessary  and  sufficient  form,  in  (6).  Here  I  give  a  solution  of  the 
general  problem,  where  a  and  ft  may  be  fractional.  The  form  of  the  con- 
ditions is  the  same  as  in  the  integral  case,  but  the  presence  of  a  fractional 
difference  gives  the  result  a  different  character.  It  is  convenient  at  the  same 
time  to  consider  the  case  with  a  parameter  p  ^  0,  but  the  case  where  p  <  0 
is  not  discussed,  nor  is  the  case  where  a  or  /?  is  negative. 


*  Bosanquet  (6,  7,  8).         f  Here  and  elsewhere  S  denotes  S. 

0 

%  Schur  (14).  §  Bohr  (3,  4).  ||   Hardy  (12). 

f  Fekete  (11).  **  Bromwich  (9).  ft  Kojima  (13). 

%%  Chapman  (10).  §§  Andersen  (1,  2). 
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2.   Notation 
We  write  8*(sv)  =  8*  =  S  ^Hfo     (a  real),  (1) 

y«0 

where  A%  is  given  by  the  identity 

(l-s)--i  =  S4S*n     (|  *|<1).  (2) 

Then  it  is  familiar  that* 

as+f-SXSf)     (a,  A  real),  (3) 

A'+f  =  8l(Ag),  (4) 


i.  -».-)•  w 

We  also  write  A«en  =  S  A~'-*e,  (7) 

v=n 

whenever  this  series  converges.  f  In  particular 


and,  if  en  =  O(l),  Aaen  exists  for  every  a  ^  0. 

When  discussing  a  series  San  we  shall  always  take 


(8) 


«„  =  -Si(«F)  =  S  «„  (») 

v=0 

so  that  »n  =  S^(«F)  =  f*--'-1     ^^  (10) 

I       5o  (n  =*  u;. 

A  series  Saw  can  always  be  chosen  so  that  S%  is  any  given  sequence.  For, 
by(a),St  =  unifon  =  S;-1(u,). 

3.    Preliminary  lemmas 
We  require  the  following  lemmas. 

LEMMA  1.  // a  +p>  -  1,  £ >0  ar^  ^*  =  o(7ia+^),  then  S%+s  =  o(n«+*>+s). 
The  same  holds  with  O  in  place  of  o. 


*   It  is  to  be  understood  here  and  elsewhere  that  S^(uv)  =   £  An_*uv,  where  {uv} 
ay  be  any  sequence,  while  S%  stands  only  for  S%(sv).  V=S>Q 

f  This  implies  en  =  o(na+1),  by  (6),  unless  a  =  0,  1,  .... 

FF2 
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By  (3)  and  (6) 


LEMMA  2.  Ifa+p  >  -  1,  £>  0  araZ  ££  =  o(na+^),  «Aen  ££-*  =  o(»a+*). 
By  (3)  and  (6) 


LEMMA  3.  Ifen  =  O(l), 


/or  a^O,  /?>  -1,  a 

If  en  —  o(l)  the  last  two  conditions  may  be  replaced  by  p^  —  1,  a 
This  is  due  to  Andersen.* 

LEMMA  4.  I/  a  ^  0,  p  ^  0  cmd 

(i)  ew  -  0(w-»),     (ii)  2?ia+^  |  A-+X  |  <  oo,  (12) 


(13) 

for  -!</?<  a, 

(6)  A%n  =  o(n-fi-»)  (14) 

/or  0  <  p  ^  a. 

This  is  due  to  Andersen.  f 

LEMMA  5.  J  If  O^S^l  and  Q^m<n,  then 

S«|.  (15) 


The  result  being  trivial  when  #  =  0  or  1,  we  suppose  that  0<  &<  1.  By 
repeated  applications  of  Abel's  Lemma,§  since 


*  Andersen  (1,  2).  A  proof  is  given  in  Bosanquet  (6). 

t  Andersen  (1,  2).  A  proof  is  given  in  Bosanquet  (6).  In  (13)  A%f*  may  be  replaced 
by   (n-hl)*+p  by  (6),  except  when  /?  =  —  1,  p  =  0,  as   is  shown  by  the  example 


|  Bosanquet  (5).   Here  we  give  an  alternative  proof. 
§  If    max  |  a0  +  al  +  ...  +  av  \  =  Mm  and  d^d^  ...Z*dm^Q,  then 
0<p<m 

j  +  .  .  .  +  dmam 
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for  0  <  v  <  m  <  n,  there  exist  integers  mv  such  that  m  ^  mx  ^  ma  ^  . . .  ^  0,  and 


S  A>-_\8 


m    J-5-1 

y      n~"  ^l1 
A*-* 


8-1  £8- 


'-1    m*+1 


Now  since  mj^j^ig, ...  is  a  non-increasing  sequence  of  non-negative  in- 
tegers, there  is  an  integer  p  such  that  mp  =  mp+li.  Therefore,  since 


max   \S*\* 


LEMMA  6.  //  {dmin}  is  a  given  double  sequence,  then  necessary  and  sufficient 

00 

conditions  that  the  series  £  smdm>n  should  converge  and  its  sum  tend  to  a  limit 

m  =  0 

as  ra.->oo,  whenever  sn  —  o(l),  are  that 

CO 

(i)    S  |  ^m,n  I  ^  -^"i  where  M  is  independent  of  n,  and 

m~0 

(ii)   lim  eZm>n  should  exist  for  m  =  0,  1,  .... 

71->QO 

//o  is  replaced  by  O,  then  condition  (i)  must  be  replaced  by  the  condition  that 

QO 

(m)  S   |  ^m,n  I  should  converge  uniformly  with  respect  to  n. 

m  —  O 

This  is  due  essentially  to  Schur.f 

*  Incidentally  we  note  that,  if  0<d<  1,  then  A^~1/A^~l<  1,  and  hence  there  is 
strict  inequality  in  (15)  unless  S  J  =  0  for  0^/*<m,  i.e.  unless  s0  —  st  =  ...  =  «m  =  0. 

f  Schur  (14).  Instead  of  the  class  of  nul  sequences  {sn}  Schur  considered  the  class 
of  convergent  sequences  {sn}.  In  the  latter  case  a  further  condition  must  be  added 

CO 

to  (i)  and  (ii),  namely,  (iv)  that  lim    S  dm>n  should  exist.   I  am  indebted  to  Mr  C.  A. 

n—  ><&  m=0 

Rogers  for  pointing  out  that  this  condition  should  not  be  included  in  the  lemma  as 
given  here.   In  his  proofs  that  (i)  and  (ii)  are  necessary  Schur  used  nul  sequences. 
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4.    The  main  lemma 

Before  giving  the  main  result  we  prove  one  more  lemma. 
LEMMA  7.  //0</?<a,  #^0,  #^0,  q<p+  1  and 

(i)  S%  =  o(n«+*>),  (16) 

(ii)  en  =  0(^-«-«),  (17) 

(iii)  £  ™a+s  |  Aa+1en  |  <  oo,  (18) 

«Ae»  snen  =  o(^-*)     (C,fi).  (19) 

T^e  conclusion  also  holds  if  o  and  0  are  interchanged  in  the  hypotheses.* 
Proof.  We  consider  only  the  first  of  the  alternative  sets  of  hypotheses. 
The  proof  in  the  other  case  is  similar. 

Case  1.  Suppose  that  /?  =  0.  Then  by  (16),  (17)  and  lemma  2, 

sn  =  S*,  =  o(na+*>)     and     en  =  O(n-*-*). 
Hence  snen  =  o(^a+^)  O(n~a-«)  =  O(T^-«). 

Case  2.  Suppose  that  0<fi^a<l.  Then  by  (1)  and  (3),  since 
sn  =  Si  =  S~"(Sf), 


A—  a—I 
-"-V—IL 


/i-O 

where,  by  partial  summation, 


*  In  the  case  where  a  and  /?  are  both  integers  this  result  has  been  proved  in  (8), 
with  condition  (iii)  replaced  by  the  weaker  condition 

(iii)'     Aaew  =  O(n~a~9)t 
and  with  the  conditions  p  ^  0,  q  ^  0  removed. 
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Now,  by  (4), 

A  -  e^  So4k}_,4  J"-1  -  en+lAfc«~i.  (20) 

Hence,if/?  =  a,  4  -  f  ^ 

^  I   0 

and  I^JA  =  S£e 

=  o(rc« 
If  0</?<a,  by  (17),  (6)  and  (20), 

/x 
and,  since  a  +p  ^  0, 


Next  /2  =  S  Aev  J, 

v«/t 

where  J  =  S  ^^Z^-f-1.  (21) 

P=*P 

We  show  now  that, 


r-v  (22) 

where  ^  is  independent  of  /£,  i^,  w. 

We  first  observe  that  A^a~l  =  1  >  0  and,  since  0  <  a  <  1, 

A    .-,.-«(-«+!)...(-«+/.-  D<0     {/t>1).  (23) 

1  ,  ^  .....  ^£ 

Also  ^l^"1  =  1  >  0,  and,  since  0<fl<I, 

0<Af-l  =  ^f^A'zl<Af2\     (p>l).  (24) 

/* 

It  foUows,*  by  (23),  (24)  and  (4),  that 


/^-l  I   J-a-1  I        J^"1         1x4  -a-1  I  Afi-1  I 

n-^  I  ^0  I  ~~^w-/t-l  I  -o-l  I  ~~  •  •  •  ~~  ^n-v  I 


v—a 

x|/*-l    V    /l- 
-n-n—u  2j  -^-o- 

(T  — 0 

40—  1  4— « 

•a-n—u'*1v—iif 


*  Cf.  Andersen  (2),  70,  for  a  similar  argument. 
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On  the  other  hand,  by  (23),  (24),  (4)  and  (6),  if  0  ^/*  <  v  ^  n, 

v  n 

7  —   V    A0-1  A—  «—  1>  V    Aft—  l  A—*-1 
J  —   2j  ^-n-^--t     &  2j  ^-n-^--t 


Thus  (22)  holds,  with  #  =  max  (1,  K2). 

We  now  have,*  by  (22),  (7),  (4),  (5)  and  (6),  since  Aew  =  A~a(Aa+1en 
by  lemma  3, 


V    Aa~  1  A—  * 

Li  ^-p-v^-v-fj. 


I  Aa+1e 

I  ^       bp 

I  Aa+1e 

a       c./> 


since  a  +  <?  ^  0.  It  follows  that 

S  ^2A  =  S  o(/ 
A=o  /*=o 

=  S  O 


since  /?  >  0  and  p  —  q>  —  1. 


*  Cf.  Andersen  (2),  lemma  4. 
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Thus,  collecting  our  results,  we  have,  in  case  2, 


Case  3.   Suppose  that  0  <  ft  <  1  ^  a,  and  assume  the  lemma  with  a,  /?,  p,  q 
replaced  by  a—  1  ,  T,  p+l,q+l,  where  r  =  min  (/?,  a  —  1  ). 
We  have,  by  partial  summation, 

S  a.e,  =  S*en+1  +  S  SJAe,,  (25) 

v=0  i>  =  0 

i.e.  SJ^e,)  =  ^6n+1  +  ^(^}  AO,  (26) 

and  hence,  by  (3), 


Thus,  by  (3),        Si(svev)  =  S^(Slev+1)  +  ^(^Ae,).  (27) 

Now,  by  (16)  and  lemma  2,  since  a^l,  £*  =  o(na+p).  Also,  by  (17), 


n+1  =      7).  Hence 


It  then  follows,  by  lemma  2,  since  /?—  1  <  0  and  fl+p  —  q>  —  1,  that 

^(SJiWi)  =  <>(*/+"-«).  (28) 

We  next  show  that  S^,  Aen  satisfy  the  hypotheses  of  sn,  en  in  the  case 
assumed.  In  the  first  place,  by  (16)  and  (3), 


.  (29) 

Next,  by  (  1  7  ),  Aen  =  en  -  en-1 


On  the  other  hand,  by  (17),  (18)  and  lemma  4(6),  since 


we  have  Aen  =  ^(^-s-1)  =  0{7i(«-D-<<*-i>-«z+i>}. 

Thus  Aew  =  O^^-^^+W},  (30) 

where  r  —  min  (/?,  a  —  1  ). 

Also,  by  (18)  and  lemma  3,  since  en  =  0(1), 

=  S^a+«  |  A^^  |  <  oo.  (31) 
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Thus  /S^,  Aen  satisfy  the  hypotheses  of  sn,  en  in  the  lemma,  with  a,  /?,  p,  q 
replaced  by  a—  ltT9p+l,q+l,  and  it  follows  from  our  assumption  that 

Si  Aen  =  o^+iMa+D}     (  <7,  r),  (32) 

and  hence,  by  lemma  1,  that 

S*n&en  =  o(n?-«)     (<?,/?),  (33) 

i.e.  ££(£JAe,)  =  o(^+*-«).  (34) 

We  therefore  have,  by  (28),  (34)  and  (27), 


and  case  3  follows  by  induction  from  cases  1  and  2. 

Case  4.    Suppose  that  1  ^  /?  ^  a,  and  assume  the  lemma  with  a,  /?,  #>,  # 
replaced  by  a—  1,  /?—  1,  p+  1,  <?,  or  by  a—  1,  r,  p+  1,  q+  1,  where 

T  =  min(/?,a—  1). 

We  again  use  (27),  and  we  begin  by  showing  that  Sfa,  en+l  satisfy  the 
hypotheses  of  sn,  en  in  the  first  case  assumed. 
We  have,  by  (16),  (17)  and  (3), 

8t~l(8i)  =  S%  =  o{^a-1>+<^+1>},  (35) 

en  =  Ofwtf-iMa-D-a},  (36) 

and,  by  (17),  (18),  (6)  and  lemma  4  (a),  since  en  =  O(^~a~«)  =  O(-«), 


(»  +  l)(a-1)+«  I  A<«-1>+1en+1  1  =  S  w—i+fl  |  Aaen  I  <  oo.  (37) 

o  i 

Thus  /8^,  eM+1  satisfy  the  hypotheses  of  sn)  en  with  a,  /?,  jp,  g  replaced  by 
a—  I,/?—  1,2>+1,  £,  and  it  follows  from  the  first  case  assumed  that 

(38) 
(39) 

Again,  as  in  case  3,  $^,  Aen  satisfy  the  hypotheses  of  sn,  en  with  a,  y^,  p,  q 
replaced  by  a—  1,  r,  p+l,  q+1,  where  r  =  min(/?,  a—  1),  and  it  follows 
from  the  second  case  assumed  that 


Sln  Aen  -  o{rcCP+i>-<<z+i>}     (  (7,  T),  (40) 

and  hence,  by  lemma  1,  that 

,  (41) 

(42) 
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Thus  it  follows,  by  (39),  (42)  and  (27),  that 


and  case  4  follows  by  induction  from  cases  1,  2  and  3. 
This  completes  the  proof  of  the  lemma. 

5.    The  main  result 


THEOREM  A.  //  0^/?^a,  ^p^O,  then  necessary  and  sufficient  conditions 
that  %anen  should  be  summable  (C,fl)  whenever  S%  =  o(n*+p)  are 


(i)  en  =.  O(7^-a-^),     (ii)  27ia+*>  |  A^^  |  <  oo.  (43) 

//  J3  >  a  ^  0,  p  ^  0,  the  conditions  are 

(i)'  en  «  O(n-*>),     (ii)  Sw«+»  |  &"+len  |  <oo.  (44) 


i/  0  ara£  o  are  interchanged  throughout.* 
Proof.  We  shall  consider  only  the  case  where  8%  =  o(na+I)).  The  alter- 
native theorem  is  proved  by  a  similar  argument. 

Sufficiency.  Suppose  that  Q^fi^a,  that  (i)  and  (ii)  hold,  and  that 


Write                                  avev  =  *nen+1-f  S  *v^.  (45) 

p«0  v-O 

Now                                           >S«  =  o(n«+v),  (46) 

en+i  =  0(n>—  »)  (47) 

and                                        Sw«+»  |  Aa-*-%n+1  1  <  oo.  (48) 
Hence,  by  lemma  7,  with  q  =  p, 

<VW  =  °(1)     <£./?)•  (49) 

We  shall  show  that  the  series  £#„  Ae,,  is  convergent  if  0  <  a  <  1,  and  sum- 
mable (C,  T),  where  T  =  min  (/?,  a—  1),  if  a  >  1. 


*  There  is  also  a  version  where  the  condition  that  S%  =  o(na+  *)  is  replaced  by 
££  ~  «^L*H'1'  /or  5o?n6  5.  The  necessity  part  is  then  included  in  theorem  A.  To  prove 
the  sufficiency  part  we  may  subtract  sA%"*  from  an  and  s^A^~len  from  Sanen, 
and  so  deduce  the  result  from  theorem  A.  The  convergence  of  Su4^~1en  follows 
from  conditions  (i)  [or  (i)7]  and  (ii)  by  lemma  4  (a). 
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Case  1  .  Suppose  that  0  <  /?  <  a  <  I  . 

We  write,  since  Ae,,  =  A~a(Aa+1e,,),  by  lemma  3, 

S  *v&ev  =  S  5,  S  ^IJA^ 

v  =  0  v  =  0     ^=v 

ro       ?i  w          oo 

-  2   S  +  S     S 


Then 


=  S  A^e^  f  ^«=}«, 


S  ^J 

* 


->  S  fifjA^e  , 

^-o    *          * 

as  ri->oo,  since  S  |  fifjA^^  |  <^S(/^+  l)a+p  |  Aa+1e/{  |  <oo,  by  (ii). 
Also  J2  =  S  *p    S    ^?Z 


=    S  A-* 

/*=w+l 

and  hence,  by  lemma  5,  since  a 


=  0(1). 

Thus  S«v  Aev  converges,  and  so  is  summable  (C,  ft),  by  lemma  1  .  It  follows 
from  (45)  that  %anen  is  summable  (C,p)  in  case  1. 

Case  2.  Suppose  that  0</?<a,  a  >  1,  an^  assume  the  theorem  with  a,  /?,  _p 
replaced  by  a  —  1  ,  r,  ^>  +  1  ,  where  r  —  min  (/?,  a  —  1  ). 

We  have,  as  in  the  proof  of  lemma  7  (cases  3  and  4), 

(50) 
(51) 
|  <  oo,  (52) 
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where  r  =  min  (/?,<%—!),  i.e.  $*,  Aen  satisfy  the  hypotheses  of  sn,  en  in  the 
theorem,  with  a,  /?,  p  replaced  by  a  -  1  ,  r,  p  +  1  .  It  follows  from  our  assump- 
tion that  SsnAen  is  summable  (O,r),  and  hence,  by  lemma  1,  is  summable 
(C,fi).  The  summability  (<?,/?)  of  SanAen  is  thus  proved  by  induction  in 
case  2,  and  it  again  follows  from  (45)  that  SaMew  is  summable  (O,/?). 

This  completes  the  proof  of  the  sufficiency  when  0  ^  ft  ^  a. 

Suppose  that  /?>a^0.  Then  conditions  (i)'  and  (ii)  are  the  same  as  con- 
ditions (i)  and  (ii),  with  ft  =  a,  and  hence  %anen  is  summable  (C,  a)  whenever 
$J  =  o(na+v).  It  follows  from  lemma  1  that  1^anen  is  summable  (C,fl). 

This  completes  the  proof  of  the  sufficiency  part  of  the  theorem. 

Necessity.  We  first  show  that  condition  (i)'  is  necessary.*  We  observe 
that,  if  %anen  is  summable  (C)  whenever  $£  =  o(rc,a+:p),  then  in  particular, 
by  lemma  1,  it  is  summable  (C)  whenever  sn  —  o(nP). 

Now  suppose,  if  possible,  that  limn^  |  en  \  —  +  oo.  Then  there  are  positive 
integers  ny  such  that  n-L<n2<  ...,  and  n%  \  enp  \  >  v3  (v  ^  1).  Choose  an  so  that 


Then,  if  ^?  >  0  and  n[l^n<  TZ^+X,  we  have 


while,  if  39  =  0,  we  can  make  sn  =  o(l)  by  altering  a0  suitably.  But,  for  v  ^  1, 


and  anew  =  0  (w=j=wv).t  Hence  2anen  =  4-oo,  which  contradicts  the  sum- 
mability (C)  of  Zarten.  Thus  (i)'  is  necessary. 
Next  we  have,  by  (1),  (3)  and  (10), 


in 

—  —  —    Y   /l^       G 

jfl    Zj  ^n—  v^v 

f 


. 
-    V          JL- 

a+V 


*  When  0^/5^  a  condition  (i)'  is  included  in  condition  (i). 
t  Except  possibly  for  n  =  0,  if  p  =  0. 
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where  *„  =  S  A^A-*^et.  (53) 


It  follows  that,  if  2anen  is  summable  (£,/?)  whenever  S^jA^  =  o(l), 
there  is,  by  lemma  6,  a  number  M,  independent  of  n,  such  that 

(64) 


where  d^n  is  given  by  (53). 

It  follows  from  (54),  in  particular,  that 


(55) 
(56) 

and  hence,  by  (6),  en  =  0(n^~a-~p).  (57) 

Thus  condition  (i)  is  necessary.* 

OO 

Next  since  en  =  O(l),  by  (i)  or  (i)',  the  series   2  A7-p*Gv  converges,  and 
so,  by  lemma  1,  is  summable  (<7,/?).f  Hence 

1        n  oo 

(58) 


as  n  ->  oo,  and  therefore,  for  each  /*  ^  0, 


lim  dfin  —  Ap+pAa+le  .J  (59) 

W->oo 


*  When  /?>a^0  condition  (i)  is  included  in  condition  (i)7. 

We  may  also  prove  the  necessity  of  condition  (i)  without  using  lemma  6.  For 
suppose  if  possible  that  lim  |  n~^+a+3)en  |  =  +  oo.  Then  there  is  an  increasing  sequence 
of  positive  integers  nv  such  that 


nvAV-i-*-+°°     and     n~0+a+v  |  env  \  >  v 
Now  choose  an  so  that  #«   =  v-ln«+p,  S*  =  0  (n^nv).  Then  S%  =  o(na+»),  and 


an  =  S~  = 


Hence  |  n~f>anvcnv  \  -  v-in-^+3>  i  e«v  |  >  1, 

and  so  anen  is  not  o(n#).  This,  by  lemma  2,  contradicts  the  summability  (G,  ft] 
of  Lanen. 

f  We  may  consider  the  series  as  starting  at  the  term  for  which  v  =  0,  the  first 
ft  terms  being  zero. 

J  This  is  condition  (ii)  of  lemma  6. 
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It  follows  from  (54)  that,  for  each  m  >  0, 

S   lim|dA.|<Jf,  (60) 

/t«On-»-oo 
oo 

and  hence  £    lim  jd^  |  ^  Jlf,  (61) 

/»"=0  tt~voo 

i.e.  S  ^J+P  I  A<X+1 

X=o 

which,  by  (6),  shows  that  (ii)  is  necessary. 

This  completes  the  proof  of  the  necessity  part  of  the  theorem. 

6.   Additional  remarks 

I  mentioned  in  the  introduction  that  theorem  A  (with  p  =  0)  was 
formulated  by  Schur  in  the  case  where  a  and  /?  are  integers.  It  is  natural  to 
inquire  how  he  proved  the  theorem. 

He  observed  in  effect*  that 

2  Ai_farer  =  S  S«A«+Vfi-,«,).  (63) 

j/  =  0  v=0 

where  the  broken  difference  t^ev  is  obtained  from  the  difference  AAe,,  by 
putting  e^  =  0  for  /*  =  T&+  1,  w+  2,  ...,  and  he  stated  that  the  result  could 
then  be  obtained  by  using  lemma  6  (above).  So  far  as  the  necessity  of  con- 
ditions (i)  and  (ii)  is  concerned  his  argument  was  probably  much  the  same 
as  the  one  I  have  given  above,  while  the  necessity  of  (i)'  may  be  established 
in  various  ways.f  Concerning  the  sufficiency  Prof.  Hardy  has  recently 
shown  me  how  the  Bohr-Hardy  theorem  may  be  deduced  from  (63)  by  means 
of  lemma  6,  and  I  have  verified  that  his  method  may  be  adapted  to  the 
case  where  a  and  ft  are  unequal  integers,  and  there  is  a  parameter  p.  His 
method  shows  that,  if  a  and  ft  are  integers  and  p  is  real,  conditions  (i)  and 
(ii)  of  theorem  A  imply  that 


where  M  is  independent  of  n.  This  is  condition  (i)  of  lemma  6,  with 
given  by  (53),  while  condition  (ii)  is  trivial. 


*  I  use  the  notation  of  Andersen  (1,2),  and  of  the  present  paper. 
|  Cf.  Bosanquet  (7,  8).  When  ex.  is  an  integer,  the  series  denning  Aa+1en  terminates 
and,  if  a  parameter  p  is  inserted  in  the  theorem,  p  may  be  any  real  number. 
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Still  another  way  of  establishing  the  sufficiency,  when  a  and  fl  are  integers 
and  p  is  real,  is  to  adapt  Hardy's  original  method  of  proof  of  the  Bohr-Hardy 
theorem,  starting  from  (63).  This  is  easily  carried  out  if  we  follow  Ander- 
sen's* compact  proof  of  the  Bohr-Hardy  theorem,  making  suitable 
modifications. 
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Introduction 

Let  r  be  an  integer  at  least  equal  to  2.  A  natural  number  will  be  called 
an  r -number  if  it  is  divisible  by  the  rth  power  of  some  prime;  otherwise  it 
will  be  called  an  r-free  number.^  It  is  well  knownj  that  the  number  of  r-free 
numbers  not  exceeding  x  is  ^^  +  Q^1/r)  ( l  ^ 

as  x->ao,  and  it  is  natural  to  investigate  the  frequency  of  occurrence  not 
only  of  r-free  numbers  but  also  of  systems  of  r-free  numbers.  This  problem 
was  studied  (for  r  =  2)  by  S.  S.  Pillai§  who  established  an  asymptotic  for- 
mula, with  an  error  term  0(#/log  x),  for  the  number  N(x)  =  N(x;  dlt . . .,  dr_^ 
of  systems  of  2-free  numbers  q±,  q^  +  d-^  ...,q1  +  dr_±  not  exceeding  x. 

We  shall  here  be  concerned  with  a  still  more  general  question,  namely  the 
frequency  of  occurrence  of  certain  patterns  of  numbers  whose  character 
with  regard  to  divisibility  by  rth  powers  is  prescribed.  Let  als . . . ,  e^;  6lt . . . ,  6m 
be  any  distinct  positive  integers. ||  We  denote  by  H(x)  —  Hr(x;  c^,  ...,0^; 
blt...,bm)  the  number  of  systems  of  positive  integers  n-\-a^  ...,?i-faz; 
n  4-  &!, . . . ,  n  +  bm,  not  exceeding  x,  and  such  that  the  first  Z  of  these  numbers 
are  r-free  whilst  the  remaining  ra  are  not. 

*  This  paper  includes  some  material  from  the  author's  M.Sc.  thesis  (London, 
1941). 

f  The  customary  term  for  2-free  is  square-free  or  quadratfrei. 

$  For  the  case  r  =  2,  see,  for  example,  E.  Landau,  Handbuch  der  Lehre  von  der 
Verteilung  der  Primzahlen  (Leipzig  and  Berlin,  1909),  2,  580-582.  The  general  case 
can  be  dealt  with  in  precisely  the  same  way. 

§  "On  sets  of  square-free  integers",  Journal  Indian  Math.  Soc.,  N.S.,  2  (1936), 
116-118. 

||  We  assume  that  I  +  m  >  0,  but  the  cases  I  >  0,  m  —  0  and  I  —  0,  m  >  0  are  admitted. 
In  the  former  case,  for  instance,  the  system  of  numbers  ox,  ...,aj;  bit  . ..,  bm  is 
interpreted  as  ax,  . ..,  at. 

SBB.  2.     VOL.  50.     NO.  2410.  GG 
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Our  principal  result  (theorem  2)  is  the  asymptotic  formula 
H(x)  =  hx+  O(x«)     (x^ao), 

where  a  =  (I  +  m)/(r  +  l  +  m—  1)  +  e,  and,  except  in  certain  trivial  cases  when 
H(x)  vanishes  identically,  h  =  hr(al9  ...,«z;  bl9  ...,&m)  is  a  positive  constant 
whose  value  will  be  determined.  The  proof  of  this  formula  will  depend 
upon  a  generalization  of  Pillai's  result  in  a  different  direction  (theorem  1). 

In  the  final  section  we  consider  an  application  of  theorem  2  to  the  study 
of  frequency  of  systems  of  consecutive  r-free  numbers  and  r-numbers. 

I  wish  to  thank  Dr  T.  Estermann  for  pointing  out  to  me  that  a  method 
used  by  him  in  estimating  certain  sums*  was  also  applicable  in  the  present 
problem.  Dr  Estermann's  remark  enabled  me  to  improve  considerably  on 
my  previous  results.  I  am  also  under  a  great  obligation  to  Dr  R.  Rado  who 
read  the  manuscript  and  made  many  valuable  suggestions. 

Notation 

If  n  is  any  variable  and  I\(n),  P%(n)  are  two  propositions  concerning  this 
variable,  then  r>  /  \ 


means  that  for  every  n  for  which  P%(n)  holds,  P^n)  holds  also; 


means  that  for  some  n  for  which  P%(n)  holds,  P^n)  holds  also. 
For  instance,  —  <  1     (n^  1), 


Throughout  x  denotes  a  positive  real  number;  e  denotes  an  arbitrarily 
small  positive  number;  p,plt  etc.,  denote  prime  numbers. 

The  O-notation  refers  to  the  passage  #->oo,  and  the  constants  implied 
by  it  may  depend  upon  parameters  other  than  x. 

(nlt  ...,^s)  and  {%>  ~'>n8}  denote  respectively  the  highest  common  factor 
and  the  least  common  multiple  of  %,  ...,ns. 

n  ^  %,...,  na  means  n  ^  nt  (  1  ^  i  ^  s). 

d(n)  is  the  number  of  positive  divisors  ofn. 

is  Mobius's  function  ;  ju,r(n)  =  1  or  0  according  as  n  is  or  is  not  r-free. 


*  See  T.  Estermann,  "On  the  representation  of  a  number  as  the  sum  of  two 
numbers  not  divisible  by  fcth  powers",  J.  London  Math.  Soc.  6  (1931),  37-40. 
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Let  CD  be  a  function  having  for  its  argument  any  finite  system  of  integers. 
For  simplicity  we  shall  denote  such  a  function  by,  say,  O^,  ...>n8)  where 
it  is  understood  that  s  may  vary.  Then,  for  k  =  0,  the  symbol 

S          0(A15  ...,A,,i'1,  ...,vk) 

I^v1<...<vk<m 

is  interpreted  as  <£(A1}  .  .  .,  A,). 
We  write 


Mr(x;cl9...,c8)  =  Mr(x\  I;G!,  ...,cs). 

For  <r  >  0,  Z)(cr  |  n-j,  .  .  .  ,  ns)  denotes  the  number  of  different  residue  classes 
(modcr)  represented  by  n^..,,n3,  i.e.  the  number  of  integers  v  satisfying 
the  conditions  ,  .  . 

1  <  V  ^  (7, 


V  =  T^  (modcr) 

More  generally  we  define,  for  cr,  r>0,  D(<r,r  |  %,  ...,ng)  as  the  number 
of  integers  v  satisfying 

(<r,r)\v, 

i^  =  %  (modo")     [l<*^<s]. 
Clearly  D(cr,  r  \  n±)  .  ..,  ns)  is  also  equal  to  the  number  of  integers  v'  satisfying 


(CT,T)V'S^  (modcr) 
Furthermore,  0  =^  D(cr,  r  |  %,...,  tis)  ^  (<r,  r)"1^, 

and  D(<r,  1  |  %,...,  ns)  =  Z>(o*  |  n^  .  .  .  ,  na). 

If  Ax,  .  .  .  ,  Aa  are  integers  and  TZ^,  .  .  .  ,  ns  are  positive  integers,  then 


according  as  the  system  of  congruences  in  g,    ' 

g  +  A,  =  0(mod^)     (l^v^s)  (2) 

is  or  is  not  soluble. 

Lemmas 
LEMMA  1.  The  system  of  congruences  (2)  is  soluble  if  and  only  if 

(nitnj)  |  (A,. -A,)     (l^i<j^s). 

GG2 
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In  the  case  of  solubility  the  solutions  form  exactly  one  residue  class 

This  result  is  well  known.* 

LEMMA  2.  Let  T  be  the  number  of  positive  integers  g^x  satisfying  (2).  Then 

j 
T-x- 


This  follows  easily  by  lemma  1  . 

LEMMA  3.  Let  %,  ...,nst  clt  ...,ca,  #  6e 
number  of  systems  of  positive  integers  £,  fx,  . 


Then 


positive  integers,  and  T  the. 


E   *,«!»-»' 

T-x     \°-*i--< 


Proof.  T  is  clearly  equal  to  the  number  of  positive  integers  g  such  that 


Hence  it  is  equal  to  the  number  of  positive  integers  q  such  that 


and  the  required  result  follows  by  lemma  2. 

LEMMA  4.  Letf(nl9  ...,ns)  be  a  function  of  s  integral  variables. 
If(i)        2       \f(n1,...,nt)\<oo, 

HI,  ...,n,»>l 

(ii)  /(%,  .  .  .,  ?&5)  is  multiplicative  in  nt,  .  .  .,  ?&s,  i.e. 
/(«!,  .  .  .  ,  nj  /(nj,  .  .  .  ,  n'8)  =  /(%  wj,  .  .  .  ,  wX) 


where 


See,  for  example,  A.  Scholz,  Einfuhrung  in  die  Zahlentheorie  (Gdschen),  Satz  31. 
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The  case  s  =  1  is  Euler's  identity,  and  the  generalization  to  any  s  >  1  is 
immediately  obvious. 

LEMMA  5.  Let  t,q,r,  p,klt  ...,  kt  be  given  integers  ;t>l,gr>l,r>2,  and  let 
pbea  given  prime.  * 

//  1^  =  .  .  .  =  kt  =p  (modpr)t  then 


*'*1*'  ...,  ^<\(-l     if(pr,q)\P, 
0,    klt    ...,  kt  J      \     0     otherwise. 


Proof.  Since  &t  =  ...==&,  =  £  (mod  ^p*1)  and  77^=  1  [K*^fl,  it  follows  by 
the  definition  of  the  ^-symbol  and  lemma  1  that 


™/<7>  Pr^>  ...»  Prilt\  =  F(q,  Pr\  =  (1     if  (#r><7)  I  A>, 
\0,    klt    ...,  ^  /         W,  /o  /      \0    otherwise. 

2        (-i)vi+.»-*%SBS-iJ 
Also,  ?l  .....  ^.0.1        ; 

7i+...+^>0 

and  the  lemma  is  therefore  proved. 

LEMMA  6.  Let  r,  qt  s,clt  ...,  c8  be  given  integers  ;r^2,  q^l,8^l9  and 


o      o    /,.          ^x  v       /^K)...XnJ     /g,  wj,  ...,  n;\ 

O    =  Or  nlC^  *  •••  •  C.I   =  X,  -7  -  z  --  ^   ^1  ^  I  ' 

r,flv  i>         «/     nit,^ni>1{q,nL...,n$     \0,  cx,   ...,    c,/ 

TO     i/    I>(^^|c1,...,ca)  =  (^^)-V  [p]. 

>  0  otherwise. 


"1  IT  (  1  ~  ^^  D(^r,  g  |  Cj,  .  .  .  ,  c8)\ 
Prtal          ^  ; 

Proof.   Suppose  that 


(ni9n'j)=l     (KiJ^s).  (3) 

Then        q~l{qtnlt  ....njxg-1^,^  --.wi)  =  ^fe^i^i*  .--^Xl,  (4) 


\0,  cj,    ...,  c8/     \0,   Cl,    ...,  cj          \0,        cl5   ...,     c8 

To  prove  (4)  consider  any  prime  p,  and  let^>A,  p"<,  ^"i  be  the  highest  powers 
of  p  which  divide  q,  nif  n't  respectively  (1  ^i  ^  s).  It  is  clearly  sufficient  to 
show  that 


max(A,  j^,  ...,  j>s)  +  max(A,  v(,  ...,  vs)  —  2A  =  max(A,  vl  +  v[,  ...,^  +  vi)  —  A. 

But  since,  by  (3),  ^  =  ...  =  vs  ==  0  or  ^  =  ...  =  v'8  =  0,  this  is  evidently 
true,  and  so  (4)  is  established. 

To  prove  (5)  we  note  that,  if  at  least  one  factor  on  the  left  vanishes,  then 
the  right-hand  side  vanishes  also.  If,  however,  both  factors  on  the  left  are 
equal  to  1,  then,  by  lemma  1, 

K»  nj  |  (c«  -  c,)  ;  K,  q)\ 
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Therefore,  by  (3), 


njUj)  |  (c*-^), 
and  so,  by  lemma  1,  the  right-hand  side  of  (5)  is  also  equal  to  1.  Thus 


where  /(%,  ...,ns)  is  multiplicative  in  %,...,  ws.   Furthermore,  the  series 
defining  qS  converges  absolutely,  for  it  is  majorized  by 


Hence,  by  lemma  4, 


where 


^-,  ......  s.^t-^^ 

*t+...+«y,>o 
To  evaluate  ^p  we  first  observe  that,  if 

1^*O,     l^vl<...<vi^s,    dVi  =  Sv^=  ...  =  8Vi=  1     and    cVi,...,cH 
do  not  all  belong  to  the  same  residue  class  (modpr),  then 

Jq,P'\...,prS*\      Q< 
\0,   clf    ...,   ca  ) 

Now  for  l^p^pr,  let  £p  denote  the  number  of  numbers  amongst  c1?  .  .  .  ,  c8 
which  are  congruent  to  p  (mod^>r).  If  tp>Q,  denote  these  numbers  by 
tff\  —,  We  then  have 


=    S          S       ( 
i<p<2Jr*,,....*<p-ofi 

tp>0      i/i+...+7»p>0 

=  —     2     1     (by  lemma  5) 


Hence,  by  (6)  and  (7), 
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The  first  assertion  is  now  evident.  To  prove  the  second  we  suppose  that 

D(p',q\  cl9  ...,ca)  <  (pr,q)~lpr     (p). 
Then  D(^5g|Cl,...,cg)  =  0     if    f\q. 

We  therefore  have 


Furthermore,  the  value  of  the  infinite  product  is  positive,  since  each  factor 
is  positive  and  since  the  factor  corresponding  to  p  is  at  least  1  —  sp~~r  for  all 
sufficiently  large  values  of  p. 

The  general  pattern 
THEOREM  1.  Let  r}  q,  s,  cl3  .  .  .,  cs  be  given  integers;  r  ^  2;  q,  s,  clt  .  .  .,  cs  ^  1. 

(i)  IfD(p*,q\  clf  ...,ca)  =  (pr,q)~lpr  [p],  then  MJ(x\  q\  clf  ...,cs)  =  0. 
(ii)  Otherwise* 

Mr(x;  q>  clt  ...,ca)  =  8,.^,  ... 

((  nr  o 
l-^ 
-      .,  P 

Proof,  (i)  Suppose  that,  for  some  p, 


This  means  that  for  every  k  in  the  range 


we  have  k(  pr,  q)  =  ct  (mod#r)     [1  ^  i  ^  s].  (8) 

Next  we  write  v  =  q/(pr,  q).  We  then  have,  for  every  n  ^  1, 


Hence  pr  \  nq  +  k(pr,  q)     [l^k^  (pr,  q)-lpr  J, 

and  so,  by  (8),  pr\nq  +  ci     [l^i^s]. 

Therefore  J^,(aj;  q;  clt  ...,cs)  =  0. 

(ii)  Now  suppose  that 


For  the  remainder  of  this  proof  it  will  be  understood  that  summation  indices 
range  over  positive  integers. 

*  The  method  used  in  the  proof  below  does,  in  fact,  show  that  for  8  —  1  the  error 
term  is  O(xst(r+'-V)  =  O(xltr). 
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Let  0  <  a  <  r*1.  Using  the  relation 

Mn}  =  S  /*(<>)> 

cMn 

we  have 


Si  +  S2,        (9) 


where  in  ^  the  outer  summation  extends  over  nlt  ...,7is<a;a,  whilst  S2 
consists  of  the  remaining  terms. 
By  lemma  3, 


^    fa?""> 

W>  ^i>  •••>  ^ 


::         S       {»!,...,  wS 

ft—  1       w*>a^ 


by  lemma  6.  But 


S      M,...^;}-1^   S   ^-r       S        1^   S   (<*(»))•»-•  =  0(^-^-D+«), 

«*>«"  n>x*        {nlf  ...,n,}=>n         n>^" 

ni,...,n,>l 

since  d(n)  =  0(^e).  Hence 

Si  -^Jljl-^—^^^  (10) 


where    /*=         S       1=        S  H    f 

n\rt\-nq~c\  w*r<*-7ig=c*  l<A<s\nAr< 


na^x  n*>xot 

n»>x°( 

s      n 


so  that  S2  =  0(a;1~a(r-1>+€).  (11) 
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Putting  a  =  l/(r  +  5-  1),  we  have,  by  (9),  (10)  and  (11), 

Hfc;  g;  clf...,ca)  =  xq-ln  (l-(-^^-D(pr,q\clLj  ...,c\  +  O(xW+*-*>»). 

The  proof  is  completed  by  noting  that  the  value  of  the  infinite  product 
appearing  in  this  formula  is  positive  by  lemma  6. 

THEOREM  2.  (i)  If  D(pr  \  alt ...,«,)  =  pr    [p],  then  H(x)  =  0. 
(ii)  Otherwise,*         jj^ 


...  P 

and  h>0. 

Proof,  (i)  Suppose  that 

D(p'\ali...,al)=p>-     [p]. 
Let  n^l.  Then 


jpr    [jp], 

and  so  pr\n  +  ai     [l^i^l]. 

Hence  ^T(«)  =  0. 

(ii)  Next  suppose  that 

D(pr\alt...,al)<pr     (p). 
Write  t  —  max  (alt  ...,  o^,  &!,  ...,  6m).  Putting  g  =  1  hi  theorem  1,  we  obtain 

J^(x',cl,...,c8)  =  xu[l-D(pr\C»'''>C')}  +  0(^^  (12) 

P  \  P  ) 

We  have 

H(x)  -      S 


),     by  (12), 


*  If  /  =  1,  m  =  0  or  I  —  0,  m  =  1,  then  it  is  seen  immediately  from.  (1)  that  the 
error  term  is  O(xllr). 

f  When  Z  =  0  the  term  corresponding  to  k  =  0,  in  the  sum  expressing  h,  is  inter- 
preted as  1. 
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It  remains  to  prove  that  h  >  0,  and  this  will  clearly  follow  from 
H(x)>Ax     (X>XQ), 

where  A,  x0  are  some  positive  numbers  independent  ofx. 

Assume,  without  loss  of  generality,  that  £>0,  m>0.  Let  plt  ...,pm  be 
any  m  distinct  primes  exceeding  tllr,  and  write  q  =  p±  .  .  .  prm.  Let  n0  be  the 
least  positive  solution  (known  to  exist  by  lemma  1  )  of  the  system  of  con- 
greencesinn, 


(13) 
Then  every  positive  solution  of  this  system  is  given  by 

n  =  nQ  +  qv,     i>  =  0,  1,2,  .... 
Therefore,  by  (13)  and  our  choice  of  pk's, 

Pktno  +  di     (l^k^m,  l^i^l).  (14) 

If  p  —  pk  [1  ^  k  ^  m],  then 

D(pr,q\nQ  +  a^...inQ  +  al)  =  D(pr,pr  \  n0  +  al9  ...,^0  +  az)  =  0     by  (14). 
If  p3=pk  (1  ^k^m),  then 

D(pr,q\n0  +  a1,...,n0  +  al)  =  D(pr  \n0  +  alf  ...,nQ  +  a^) 

—  D(pr  |  alt  .  .  .  ,  Orj)  <  pr     by  hypothesis. 
Thus,  in  any  case, 

We  have 


>Ax     (x>x0), 
by  theorem  1  and  (16).  This  completes  the  proof. 

Two  special  cases 

Let  C  be  a  specified  class  of  integers.  By  a  block  ofs  integers  with  respect 
to  C  we  shall  understand  a  sequence  of  s  consecutive  positive  integers,  say 
nt  n+I,  ...,  n  +  s—  1,  which  are  in  (7,  whilst  n—  1  and  n  +  s  are  not  in  G. 
By  an  easy  application  of  theorem  2  we  are  able  to  investigate  the  frequency 
of  blocks  of  r-free  numbers  and  of  r-numbers. 
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Let  QTt  8(x)  denote  the  number  of  blocks  of  s  r-free  numbers  not  exceeding 
x,  and  Vr>a(x)  the  number  of  blocks  of  s  r-numbers  not  exceeding  x. 

THEOREM  3.  (i)  For  r^2,s^2r,  Qr>8(x)  =  0. 
(ii)  Forr^Z, 


where 


and  #r,g>0. 

Proof,  (i)  r^  2,  s^2r  imply  Cr,s(^)  =  °>  since  of  2r  consecutive  integers 
one  must  be  divisible  by  2r. 

(ii)  Suppose  that  r  ^  2,  1  <  5  ^  2r  —  1  ,  so  that 

D(pr\2,3,...98+l)^8<2r^p     (p). 
Suice  Qrt8(x)  =*  H^x;  2,3,...,*+!;  l, 

it  follows  by  theorem  2  that 


where  ^r,s  >  0  and 

.8....,<+1)| 

P'       i  l} 

I  - 


THEOREM  4.  J^or  r  >  2,  5  ^  1, 


n 


P 

and  vr>8>0. 


508  L.  MlESKY 


Proof.          Vrt8(x)  =  Hr(x\  l,«  +  2;  2,3,  ...,a 

=  vr>  8x  +  0(a^«fr4*f  w+e)    by  theorem  2. 

Here  vr  a>0,  and 


S(-D*     s        n    i--  n 

*-0  Kv^r^Ka  P 


COROLLARY.   For  r^2,  I  ^  s  ^  2r  -  2, 


2'--2  /2r  —  2 

Furthermore,        vft^  =  (l-2-E(- 


These  results  follow  almost  immediately  from  theorem  4. 

Added  in  proof  (2  September  1948).  Since  writing  this  paper  I  have 
pursued  the  subject  a  little  further,  and  have  been  able  to  reduce  the  order 
of  the  error  terms  in  all  the  asymptotic  formulae  given  above.  For  a  brief 
exposition  of  the  improved  method  see  my  paper  "Note  on  an  asymptotic 
formula  connected  with  r-free  integers",  Quart.  J.  of  Math.  (Oxford),  18 
(1947),  178-182. 
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Introduction 

1.  Let  S  be  an  abstract  set.  The  elements  of  S  will  be  called  points.  8  is 
supposed  to  be  ordered  by  means  of  a  relation  a  <  b.  This  means  that,  given 
any  points  a  and  6,  exactly  one  of  the  three  relations 

a<b,     a  —  b,     b<a 

holds,  and  the  relations  a<  b,     b<c 

imply  a  <  e. 

A  subset  I  of  S  is  called  an  interval  of  S  if  it  satisfies  the  following  condition. 
Wheneverf 


, 

a, 

then  c  ^  I. 

There  need  not  exist  any  points  which  play  the  roles  of  end-points  of  /, 
even  in  the  case  of  "bounded"  intervals.  For  let  S  be  the  set  of  all  rational 
numbers,  ordered  according  to  magnitude,  and  let  /  consist  of  all  rational 
numbers  x  satisfying  2  <  a;3  <  3. 

This  fact  introduces  a  certain  amount  of  complication  into  our  arguments. 
Let  N  be  an  abstract  set.  The  elements  of  N  will  be  called  indices.  Suppose 
that  with  every  index  v  is  associated  an  interval  Iv  of  S.  The  possibility  is 
not  excluded  that  there  are  pairs  of  distinct  indices  v,  v'  satisfying 

•4  =  £• 

In  theorem  1  of  this  note  the  following  problem  is  solved.  Under  what 
circumstances  is  it  possible  to  divide  the  system  of  intervals  /„,  i.e.  the  index 

t  A  relation  A>*(B  denotes  the  fact  that  A  is  an  element  of  the  set  B;  B'C.B 
denotes  the  fact  that  B'  is  a  subset  of  the  set  B  ;  B'  G  B  is  equivalent  to  saying  that 
B'  C  B  and  B'  *  B.  The  relations  A  •*£  B,  B'  (£  B  are  the  negations  of  A  <(  B,  B'  C  B 
respectively.  O  denotes  the  number  zero  as  well  as  the  empty  set. 
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set  N,  into  a  given  number  k  of  parts  in  such  a  way  that  no  two  intervals  !„ 
belonging  to  the  same  part  have  a  point  in  common  ?  An  obviously  necessary 
condition  is  that  no  k  +  1  intervals  Iv  belonging  to  k  +  1  distinct  indices  v 
should  have  a  point  in  common.  Theorem  1  asserts  that  this  condition  is 
also  sufficient.  It  will  be  shown  that  this  result  is  equivalent  to  the  following 
proposition: 

If  any  system  of  k  +  1  distinct  indices  vlt  v^  ...,  vk+1  can  be  subdivided 
into  k  groups  in  such  a  way  that  no  two  intervals  /„  belonging  to  indices 
of  the  same  group  have  a  point  in  common,  then  the  same  kind  of  subdivision 
is  possible  for  the  whole  system  N  of  indices. 

Theorem  2  deals  with  the  following  problem.  Under  what  circumstances 
is  it  possible  to  find  a  finite  number  of  subsystems  of  a  given  system  of 
intervals  Iv  which  have  the  property  that  no  two  intervals  belonging  to  the 
same  subsystem  have  a  point  in  common  and,  at  the  same  time,  the  set  of 
points  covered  by  the  intervals  of  the  subsystems  coincides  with  the  set 
of  points  covered  by  all  intervals  of  the  given  system  lv  ?  A  necessary  and 
sufficient  condition  will  be  shown  to  be  the  following.  It  should  be  possible 
to  define  a  well-ordering  of  the  index  set  N  which  is  such  that  the  sot  N(x) 
of  all  indices  v  whose  corresponding  intervals  /„  contain  any  given  point  x 
possesses  a  last  element.  Furthermore,  it  will  be  shown  that,  if  any  finite 
number  k  of  subsystems  of  the  given  system  /„  can  be  found  which  possess 
the  above-mentioned  property,  then  this  is  always  possible  with  k  =  2. 
The  main  idea  of  the  proof  of  theorem  2  is  closely  connected  with  a  method 
used  by  Denjoyf  for  proving  a  more  special  covering  theorem. 

Theorems  3  and  4  are  covering  theorems  for  the  set  of  all  real  numbers. 
Theorem  3  is  an  extension  of  the  one  -dimensional  case  of  Lindelof  Js  covering 
theorem  .J  Theorems  4  and  5  are  quantitative  versions  of  theorem  2  in  which 
a  "small"  set  of  uncovered  points  is  admitted. 

Definitions  ;  statements  of  theorems  1  and  2 

2.  Sets  occurring  in  this  note  may  have  any  arbitrary  cardinal  number. 
Zermelo's  axiom  will  be  used  freely.  If 

A-L,  A2,  ...,  Ak 
are  sets,  then  the  symbols 

k 

.+AK    s  AK 


t  Reproduced  in  Hobson,  The  Theory  of  Functions  of  a  Real  Variable,  1  (2nd 
edition),  638. 

J  Carath^odory,  Vorlesungen  uber  reelle  Funktionen  (2nd  edition),  46. 
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denote  their  sum-set  and  the  symbols 


k 

H  AK 


denote  their  intersection.  A  relation 


expresses,  by  definition,  the  fact  that  the  objects 

al5a2,  ...,ar 

are  different  from  each  other.  Unless  the  contrary  is  stated,  any  set  is 
allowed  to  be  empty,  and  summation  indices  are  allowed  to  range  over  the 
empty  set,  in  which  case  the  value  of  the  sum  is  taken  to  be  0. 

THEOREM  1  .  Suppose  that  with  every  element  v  of  a  set  N  is  associated  an 
interval  Iv  of  an  ordered  set  S.  Then  it  is  possible,  for  a  given  positive  integer  kt 
to  find  k  sets  NK  satisfying 


if,  and  only  if,  the  following  condition  holds.  For  any  choice  of  k  +  1  distinct 
indices  VK  we  have  T   T        ^       _  o 

A>!-V2   •"  *Vkl-l    ~~        ' 

This  condition  is  equivalent  to  postulating  that  every  set  Nr  consisting  of  k+l 
indices  v  can  be  written  in  the  form 


where  IVIV.  =  0     (v^v'\  v,v'  *{N'K\  1 


THEOREM  2.  Suppose  that  with  every  element  v  of  a  set  N  is  associated  an 
interval  Iv  of  an  ordered  set  S.  Then  it  is  possible  to  find  a  positive  number  k 
and  to  determine  k  sets  NK  satisfying 


if,  and  only  if,  the  following  condition  holds.  The  set  N  can  be  normally  ordered^ 
in  such  a  way  that,  given  any  point  x  of  S  ^»  the  set  N(x)  of  those  indices  v 

v^N 

for  which  x  lies  in  Iv  possesses,  in  the  sense  of  this  ordering,  a  last  element. 
Moreover,  if  this  condition  holds  then  k  —  2  is  a  possible  value. 

t  I.e.  well-ordered  (wohlgeordnet),  see  Hobson,  loc.  cit.  211. 
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The  condition  in  theorem  2  is,  in  particular,  satisfied  if  every  point  occurs 
in  only  a  finite  number  of  intervals  Iv.  In  the  following  example  no  finite 
number  of  sets  NK  of  the  described  type  can  be  found  if 

£  =  ^  =  {1,2,3,...},         /,  =  {1,2,...,»}     (v^N). 

Lemmas 

3.  LEMMA  1.  Let  U  be  a  set,  V  be  a  normally  ordered  set,  with  ordering  rela- 
tion "  <  ".  Let  F(u)  be  a  function  which  associates  with  every  element  u  of  Van 
element  F(u)  of  V.  Then  U  can  be  normally  ordered  (ordering  relation  "  <s  ") 
in  such  a  way  that  F(u)<F(u'} 


Proof.  Define  any  normal  order  of  U  (ordering  relation  "->").  Define  an 
order  relation  "  <•  "  in  U  by  putting 

u<u' 

if,  and  only  if,  either  F(u)  <  F(u') 

or  F(u)  —  F(u'),     u->uf. 

Then,  obviously,  for  any  elements  u,  u'  of  U  exactly  one  of  the  three  relation  s 
u<u',     u~u',     u'<u 

is  satisfied.  Also,  the  relation  <?  is  transitive.  It  remains  to  show  that  "  <  " 
defines  a  normal  order.  Assume  that  there  is  a  sequence  u^  u2,  ...  of  elements 

of  U  such  that  ,  -.  x 

..  /  (1) 


Then 

Since  F(un)  -^  F,  and  since  "  <  "  defines  a  normal  order,  this  implies  that 

^K,)  =  JFKo+i)  =  ^ 

for  some  suitable  index  n0.  Then,  by  definition  of  "  <•  ", 


This  is  a  contradiction  against  the  fact  that  "->"  defines  a  normal  order. 
Hence  (1)  is  impossible,  and  the  lemma  is  proved. 

4.  For  lemmas  2-6  we  assume  that  S  is  some  fixed  ordered  set  (ordering 
relation  "  <  ").  Corresponding  to  any  interval  I  of  S  we  define  the  set  /+ 
by  putting  J+=  s  {x} 

x^y^I 

Clearly  /+  is  an  interval. 

t  The  symbol  {A,  Bt  ...,  K}  denotes  the  set  which  has  as  elements  the  objects 
A,  Bt  ...,K. 
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LEMMA  2.  Given  any  intervals  I,  J,  at  least  one  of  the  relations 

1+  C  J+,     J+  C  /+  (2) 

holds. 

Proof.   If  both  relations  (2)  were  false  then  we  could  find  points  a,  b 
such  that 


a-</+-/+J+,     6-<J+-/+J+.  (3) 

Then,  by  definition  of  /+, 


where  a',  b'  are  suitable  points.  Then  a  ^  6  would  lead  to 


which  contradicts  (3).  Similarly,  b  ^a  is  impossible. 

5.   LEMMA  3.  Let  r  >  0, 

7^4=0,     /+C/+ 
Then  IJz. 

Proof.  Choose  points  ip  such  that 


Then,  for  1  ,  .      .  T  r  T,  r  T+ 

~Ir    ^-  La  • 


Hence  *,,  <*,„.-< 

There  is  an  index  p0  satisfying  1  ^  pQ  ^  r, 


Then  iv  ^  ipo  <  ipo  (r,     v,  ^o  ^X  4, 

and  therefore,  by  definition  of  intervals  , 


6.  For  lemmas  4—6  we  assume  that  there  is  a  set  N  and,  associated  with 
every  element  v  of  N,  an  interval  Iv.  The  letters  a,  ft,  y,  v  denote  typical 
"indices",  i.e.  elements  of  N.  We  shall  assume,  without  loss  of  generality, 


This  is  justified  by  the  consideration  that,  for  the  purpose  of  proving 
theorem  1,  it  is  immaterial  to  which  of  the  sets  NK  any  v  belongs  for  which 
/„  =  0.  It  is,  however,  important  to  bear  in  mind  that  a=t=/?  does  not  imply 

4*V 

8KB.  2.     VOL.  50.     NO.  2411.  HH 
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A  relation  <xA/? 

expresses,  by  definition,  the  fact  that  either 


LEMMA  4.  Suppose  that  r  >  0, 


Put  IVp  =  Jp. 

Then  the  set  (1,  2,  .  .  .,  r  —  1}  can  be  written  in  the  form 


where  p,q>0,  =  r-l  (4) 


Proof.  By  renumbering  the  indices  v1?  ^2,  ...,  ^r  it  is  possible  to  obtain 
a  case  in  which,  for  some  r±  in  the  range  0  ^  rx  <  /*, 

J,*«£     (Kp^rJ,         ^  =  4     (r^p^r).  (7) 

The  hypothesis  implies  the  existence  of  points  ap,  bp,  cp  such  that 

).  (8) 


A  further  rearrangement  of  the  indices  vlt  ...,  vfi  leads  to  a  case  hi  which, 
in  addition  to  (7)  and  (8), 


ap<bp     ( 

Here  ra  is  a  number  in  the  range  0  ^  r2  ^  r1.  Then 

bp<cp<ap     (I^p^r2),  (9) 

ap<cp<bp     (r2<p^r1).  (10) 

For  if  (9)  does  not  hold  then,  for  some  /o^r2,  either  (i)  cp^bp^ap  or 


(ii)  bp^ap^cp.  By  (8)  and  by  definition  of  intervals  (i)  implies  bp^Jr  and 
(ii)  implies  ap  -^  JJ,.  Hence  in  either  case  a  contradiction  follows.  Similarly 
(10)  is  deduced.  We  have,  furthermore, 

bp<ca     (l<p,cp^ra),  (11) 

t  The  cases  p  =  0  or  q  =  0  are  to  be  interpreted  in  the  obvious  way.  The  same  holds 
throughout  this  paper. 
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since  otherwise,  for  some  pair  p,  er  satisfying  1  <  p,  cr  <  r2,  we  would  have 
c<r^bp<  CP  an(i  therefore,  by  (8),  bp-^Jr,  which  is  false.  Similarly, 

(12) 


since  otherwise,  for  some  p,  cr  satisfying  r%<p,  cr^r1}  we  would  have 
ctr<ba.^cp)  b^-^J,.,  which  is  a  contradiction. 

We  now  show  that  the  assertion  of  the  lemma  holds  when 

K,*a»  ••->**>}  =  (1>2>  ->r2},     (Ma>  .-•>*<,}  =  K+  1,  ».,r-  1}. 
If,  first  of  all,  rz  >  0  then  there  exists  p0  such  that 


Then,  by  (11), 

bp<cp^cp)         bp,cp^Jp,         op^Jp     (l^p^ra),         c,0-<«k 

^...J^O.  (13) 

This  inequality  also  holds  if  rz  =  0. 

Similarly,  if  rz  <  r^  then  there  is  p^  such  that  r2  <  p^  <  r1?  cp  ^  cpl  (r2  <  p  <  rx). 
Then,  by  (12), 

CP  ^  CPI  <  bp>         CPI  ^Jp     (r2  <  P  <  ^i)  , 

e/r2+1...Jr/ri+1...Jr  +  0.  (14) 

This  inequality  also  holds  if  r2  =  r±  .  Relations  (13)  and  (  1  4)  prove  the  lemma. 

7.    Define  a  relation  ,,  ,,  ^v 

a~p  (15) 

as  follows.  (15)  holds  if,  and  only  if,  there  exists  an  integer  r  ^  0  and  indices 
j>0,  !>!,...,  vr  satisfying 

v0  =  a,          yr  =  ft,         vp-iAvp     (1  ^p^r). 

In  particular,  a  ~  a  for  every  a.  The  relation  (15)  is  symmetrical  and  transi- 
tive and  therefore  defines  a  distribution  of  all  elements  of  N  into  non- 
overlapping  classes,  called  A-classes  or  simply  classes.  The  class  to  which  a0 
belongs  is  denoted  by  K(a0),  and  any  typical  non-empty  classes  are  denoted 
by  K,  K'.  Thus,  for  every  K, 


The  relation  a  *  ft 

is  meant  to  express  the  fact  that  (15)  does  not  hold. 

8.   LEMMAS.    Suppose  that  0 

** 


.,_. 
(16) 


HH2 
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Then  either  J«GJ>0     (a~ao)  (18) 

or,  if  (18)  is  false,  ^^     (^  ^  (19) 

Proof.  Relations  (18)  and  (19)  cannot  hold  simultaneously.  For  if  one 
applies  (18)  to  a  =  a0  and  (19)  to  /?  =  /?0,  then  the  contradiction 

^G/^G/,, 

follows.  We  now  show  that  at  least  one  of  the  relations  (18),  (19)  is  true. 
By  (16)  we  have  Iao^=I^.  Therefore  it  is  no  restriction  to  assume  that 

^4..  (20) 

Then  J,.G/,0.  (21) 

For  otherwise  we  would  have         r    /t  r 

4*o  ^-Vo' 

which,  together  with  (20)  and  (17),  would  imply  a0A/?0  and  hence  a0~/?0 
which  contradicts  (16).  We  shall  now  establish  (18).  Let  us  assume  that 
there  exists  an  index  a  satisfying 

«-«„,     /.([I,..  (22) 

Our  aim  is  to  deduce  a  contradiction. 

One  can  find  a  number  r  ^  0  and  indices  ap  satisfying 


We  deduce  from  (21)  and  (22)  that  a0=|=a.  Therefore  r>0.  Let  p0  be  the 
largest  number  p  satisfying 


The  existence  of  p0  follows  from  (21)  and  (22).  In  fact,  0  ^p0  <  r.  Then 


If 

then  it  follows  that  A 

°WiA 

which  contradicts  (16).  Hence 


But  this  is  impossible  in  view  of  the  fact  that  ap(>Aapo+1.  We  have  obtained 
the  desired  contradiction.  Our  argument  proves  that 
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The  proof  of  (18)  is  completed  by  noting  that  the  relations 

<%'~a0,     I*  =  Ifr 
would  imply  that  a'A/?0,  and  hence  a0  ~  a!  ~  /?0,  which  contradicts  (16)* 

9.   Let  N'  be  a  subset  of  N  which  has  the  following  property.  There 
exists  a  constant  C  such  that  the  relations 


imply  m^C.  Then  we  denote  by 

the  smallest  number  (7  of  this  kind.  For  all  other  sets  N'  C  N  we  put 

R(N')  =  oo. 

Thus  JK(0)  =  0. 

Clearly  #(#  '  +  N")<  R(Nf)  4-  £(#").  (23) 

The  main  part  of  theorem  1  can  be  expressed  as  follows.  If 

X(N)^k,  (24) 

then  there  are  k  sets  NK  such  that 


10.   LEMMA  6.  //  (24)  holds  then  every  class  K  contains  at  most  enumerable 
many  indices  v. 

Proof.  Let  v*  be  any  fixed  element  of  K.  Let  r  >  0, 


Apply  lemma  4,  with  vr  =  v*.  We  see  from  (5)  and  (24)  that,  in  the  notations 
of  lemma  4, 


nt^r\     i 
R(N)  ^  k, 

and  from  (6)  and  (24)  that 


Therefore,  by  adding  and  using  (4), 


Hence  there  are  only  a  finite  number  of  different  indices  v  satisfying  R/Yi>*. 
Therefore,  for  any  given  number  s,  the  number  of  different  systems 
a0,  a±,  .  .  .,  aa,  such  that 

v*  =  a0,         a^Aa,     (l^cr^«)  (25) 

holds,  is  finite.  If  now  s  and  the  aff  are  allowed  to  vary,  subject  to  their 
respective  restrictions,  then  ag  can  be  made  to  coincide  with  any  element  a 
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L1.  Let  50(a)  denote  the  smallest  value  of  s  corresponding  to  which  5+1 
indices  aff  can  be  found  satisfying  as  —  a  and  (25).  Then  for  any  number 
t  there  are  only  a  finite  number  of  elements  a  of  K  satisfying  s0(a)  =  t. 
Therefore  all  elements  of  K  can  be  arranged  in  a  sequence 

aW,  a<2>,  a<3>,  ... 
such  that  s0(a<1))  <  5o(a(2))  <  •  — 

Proof  of  theorem  1 
1  1  .   We  begin  by  proving  the  trivial  part  of  theorem  1  .  Let  us  suppose  that 


Then,  by  (23),  R(N)  ^  R(NJ  +  .  .  .  +  R(Nk)  ^  k. 

Hence  IVlI,z  ...  IVJc+1  =  0, 

whenever  (  v±  ,  .  .  .  ,  vk+^  )  +  . 

12.   Let  N'  be  a  set  consisting  of  k  +  I  indices  v.  Our  next  aim  is  to  show 
the  equivalence  of  the  following  two  statements: 

(A)  There  exist  k  sets  N'K  satisfying 

(26) 
(27) 

(B>  n  /„  =  o. 

v^N' 

Let  X'  =  fa,...,vt»j. 

Then  (vlt  ...,  VM)+. 

First  of  all,  suppose  that  (A)  holds.  Then  there  are  two  of  the  k+  I  indices 
VK  which  lie  in  the  same  set  N'K,,  say 


where  1  ^  Kr  ^  k 

Then,  by  (27),  n  /„  C  /,A/^  =  0, 

which  proves  (B). 

Next  suppose  that  (A)  is  false.   By  lemma  2,  we  may  assume  that  our 
notation  is  such  that 

/+     C/+C...C/+. 

If  V*«+° 

then  we  conclude,  by  means  of  lemma  3,  that 


1946]  COVERING  THEOREMS  FOR  ORDERED  SETS  619 

i.e.  that  (B)  is  false.  If,  on  the  other  hand, 

^.«  =  ° 

for  some  index  K'  in  the  range  1  ^  AC'  ^  k  +  1,  then  /c'  ^  k.  Call  the  k  sets 


in  any  order  N[,  N2t  .  .  .  ,  N'k. 

Then  (26)  and  (27)  hold.  But  this  is  a  contradiction  since  we  assumed  (A) 
to  be  false. 

We  have  proved  the  equivalence  of  (A)  and  (B),  and  we  now  come  to  the 
proof  of  the  main  part  of  theorem  1. 

13.    Case  1.  Suppose  that  N  is  a  finite  set.  Then  we  may  assume  that 


We  are  given  that 

In  view  of  lemma  2  we  can  choose  our  notation  in  such  a  way  that 


We  define  a  function  f(v)  (1  ^  v  ^  n)  as  follows.  Let  vQ~^N.  Suppose  f  that 
f(v)  has  already  been  defined  for  all  V<VQ,  and  furthermore,  that  this 
function  has  the  property  that 


iv    /(a)  = 
Then  Iet/(y0),  by  definition,  be  the  smallest  integer  K  such  that 


f(v0)  exists  because  &+  1  is  a  number  K,  not  necessarily  the  smallest,  of  the 
required  type.  Put  /(*>„)  =  kQ.  Then  the  definition  of  k0  implies  the  existence 
of  kQ  —  1  indices  VK  such  that 


Also 

Therefore,  by  lemma  3, 


(28) 


f  If  VQ  =  1  then  no  assumption  is  made. 
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Furthermore,  (/W'/O'i)'  —  '/OV-i))*' 

and  hence  (i>0,  ^,  ...,  i^)*.  (29) 

We  deduce  from  (28)  and  (29)  that 


Thus  the  definition  of  the  function  f(v)  has  been  extended  to  the  range 


4^  =  0     (a  </?<"„.    /(*)-/(/»))• 

In  this  way  the  inductive  definition  off(v)  for  all  v  of  N  has  been  accom- 
plished. The  resulting  function  satisfies  the  conditions 

,2  .....  *}     (v 


Let  j^  be  the  set  of  all  y  such  that 

/(")  =  *-  (30) 

Then  ^  +  ^2  +  ...+^  =  ^,  (31) 

X(NK)^1     (l^K^k).  (32) 

Theorem  1  is  proved  for  finite  sets  N. 

14.    Case  2.  Suppose  that  N  is  enumerable.  We  may  assume  that 

tf  =  {1,2.3,...}. 

By  §  13  there  exists,  for  every  n^N,  a  f  unction  fn(v),  defined  for  v^n  and 
satisfying  fn(v)^{l,Z  .....  k}     (v<n), 

4/^  =  0     (ct<ft^n,    fn(a.)=fn(fl».  (33) 

By  means  of  Cantor's  diagonal  process  one  can  find  a  sequence  nr  ->  oo  which 
has  the  property  that  the  limit 

Urn  fn.(v)  =f(v), 

n'—  >ao 

say,  exists  for  every  v.  Then  there  are  numbers  mv  such  that 


Let  NK  be  the  set  of  all  v  satisfying  (30).  Then  (31)  holds.  If 
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then,  for  any  ri  ^  min  (raa,  m^), 


and  therefore,  by  (33),  Ialft  =  0. 

This  proves  (32). 

15.    Case  3.  Suppose  that  N  is  an  arbitrary  set.  If  K,  K'  are  any  A-classes 

then  we  introduce  the  symbol 

'  (34) 


as  an  expression  of  the  fact  that  there  exists  an  index  a'  of  K'  satisfying 


At  present  we  shall  not  require  any  general  properties  of  the  relation  (34). 
Let  r  be  a  positive  integer,  and  suppose  that  the  classes  KQt  Klt  ...,  Kr 

satisfy 

(35) 


Then  there  are  indices  ax,  a2,  ...,  ar  satisfying 


Since  KQ  is  not  empty  we  can  choose  an  element  a0  of  K0.  Then,  by  definition 

°f(34)'  4.cv^."<^ 

Hence  (a0,a1}  ...,ar)+,     /ao/ai42  .../«,  =  -40*0, 

and  thus  r  +  1  <  R(N)  <  AJ. 

It  follows  that,  given  any  class  KQ,  there  exists  a  largest  number  r,  denoted, 
say,  by 


which  has  the  property  that  (35)  is  satisfied  for  suitable  classes  Klt  ...,KT. 
Then 


If  for  a  given  class  K  there  is  no  class  K'  satisfying  (34)  then  we  put  F(K)  —  0. 
Apply  lemma  1  to  the  sets 

E7  =  :£{#},     F  =  {0,  1,.  ..,£-!}. 

We  obtain  the  existence  of  a  normal  order  of  U  (ordering  relation  "<") 
which  has  the  property  that 

(36) 
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I  shall  now  define,  by  means  of  transfinite  induction,  a  certain  function 
f(v).  Let  KQ  be  any  fixed  class,  and  assumef  that 

(i)  f(v)  has  already  been  defined  for  every  index  v  satisfying,  for  some 
suitable  class  K  ,  the  relation 


(ii) 

(iii)  Ialf  =  0 

whenever  a-^K'<K0,     fi^K"<KQ,     a*/3,    /(a)  =/(/?). 

I  shall  then  extend  the  range  of  definition  of  f(v)  by  defining  f(v)  for  every 
v  of  KQ,  and  the  extended  function  will  satisfy 

(iia)  /(*K  {1,2,  ...,*}     (v^K^KJi 

(ma)  IJP  =  0 

whenever  a^K'*>K0,     fl^K*^K0,    .a*/?,    /(<*)=  f(fi). 

This  definition  of  a  function  /(v)  by  transfinite  induction  proceeds  by  means 
of  an  extension  of  the  region  of  definition  not  by  an  individual  new  index  i>0, 
as  is  commonly  the  case,  but  by  all  indices  v  belonging  to  some  non-empty 
class  KQ.  No  two  classes  K  overlap,  and  the  set  of  all  these  classes  has  been 
normally  ordered.  This  procedure  constitutes  a  valid  definition  of  a  function. 
The  function  f(v)  thus  obtained  will  be  defined  for  all  v  and  will  satisfy 

jf(vH{l,2,...,&}     (v-^N),         /a/^  =  0     (a*#  /(a)  =/(/?)). 
Hence  the  sets  NK,  again  defined  as  consisting  of  all  indices  satisfying  (30), 
will  have  the  properties  (31)  and  (32),  and  the  proof  of  theorem  1  will  be 
completed.  ' 

We  now  proceed  to  describe  the  extension  of  a  function  f(v)  which  is 
supposed  to  have  properties  (i),  (ii)  and  (iii)  above.  Let  a0  be  any  fixed 
element  of  KQ.  Suppose  that  an  index  vx  and  a  class  K1  are  such  that 

VI<KI<K»  (37) 

4oe^.  (38) 

Then  a0H!#i»     «o""'i»     *«  A  =  J«0  *  °- 

Hence,  by  lemma  5,  either 

J.GJ^     (a  -<#0),  (39) 

or  /„£/„„     (v-^i). 

The  last  relation  is  false,  in  view  of  (38). 

j-  If,  in  the  normal  order  of  £7,  K0  is  the  first  element  then  no  assumption  is  made. 
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Let  N™  be  the  set  of  all  indices  v^  satisfying  (38)  corresponding  to  which 
there  exists  a  class  K±  satisfying  (37).  Then,  by  (39), 

The  set  N&  may  be  empty.   I  assert  that  Nw  is  a  finite  set  containing,  say, 
nw  elements,  and  that 

In  order  to  see  this,  choose  any  positive  integer  mt  and  suppose  that  the 
indices  a1}  a2,  ...,  am  satisfy 

alf  02,  ...,  a^tfo,  (42) 

(al3a2,...,am)*,  (43) 

Jai/a2.../am4=0.  (44) 

Then,  by  (42)  and  (40), 


Hence,  whenever  r  >  0  and 

,  (45) 


we  have  Jai/aa  .../^/^  .../,  =  /ai  ...  /am^=0.  (47) 

Also,  by  (45)  and  the  definition  of  jY(1),  there  are  classes  Kp  satisfying 


Hence  Vp^&n     (1  </>=^r,  1 

Thus  (al5  ...,am,  i/lf  ...,vr)+, 

and  therefore,  by  (47),  m  +  r  ^  ^(^)  ^  A;. 

This  holds  for  any  choice  of  any  number  of  indices  vp  satisfying  (45)  and  (46). 
Therefore  N(1)  is  a  finite  set  containing,  say,  exactly  r&(1)  elements,  and 


This  is  true  for  every  choice  of  any  number  of  indices  a^  satisfying  (42),  (43) 
and  (44).  Hence  (41). 

According  to  (37),  f(v)  is  defined  for  every  element  v  of  N^.  The  set 

-4=2    {/(»•)}  (48) 

V   <(  JV(1) 

contains  at  mostf  n(1)  elements.  Hence  the  set 


In  fact,  A  contains  exactly  n(1>  elements,  as  is  easily  seen  from  (38)  and  (iii), 
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contains  at  least  k  —  n(l\  and  therefore,  by  (41),  at  least  R(KQ)  elements.  Put 


By  lemma  6,  KQ  .contains  at  most  enumerably  many  elements.  Hence,  by 
case  1  or  case  2  of  the  present  proof,  there  exist  kQ  sets  N*®  satisfying 


(49) 
(50) 

Let  J5  =  {61,6a>...,6<}, 

where  (&i,&2»  •••»&/)*• 

Then  £  ^  &0.  We  extend  the  range  of  definition  of  the  function  f(v)  by  putting 
/(")  =  &»  ("X-tff,  !<*<*.).  (51) 

We  have  to  show  that  (iia)  and  (iiia)  are  satisfied.  It  is  obvious  that  (iia) 
holds.  In  order  to  prove  (iiia),  assume  that  there  are  indices  al9  ^  and  classes 
K1  ',  K"  such  that 

/ai/A=t=0.     (52) 


We  have  to  deduce  a  contradiction.  There  is  no  loss  of  generality  in  assuming 

that 


If,  in  addition,  K"  <•  K0, 

then  (52)  would  contradict  (iii).  Therefore 


If  K'  =  K"  =  K0,  (53) 

then,  by  (52)  and  (51),          /(ax)  -  /(&)  =  6^, 
where  /q  is  some  number  in  the  range  1  <  K^  ^  kQ.  Then 


and  therefore,  by  (50),  4^  =  °- 

This  is  a  contradiction  against  (52).  Hence  (53)  is  false,  and 

K'<K"  =  K<>.  (54) 

Then  ai^A>     4A*°- 

Therefore,  by  lemma  5,  either 

4G/A     (a^K')  (55) 

or  IfCIai     (ft^K*).  (56) 
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Case  A.  Suppose  that  (55)  holds.  Then,  by  definition  of  (34), 

K'  A  K". 
Put  F(K")  =  s.  Then,  by  definition  of  F(K"),  there  are  s  classes  K0  satisfying 


Then,  by  definition  of  F(K'), 
F( 
On  the  other  hand,  (54)  and  (36)  lead  to 


which  is  the  desired  contradiction. 

Case  B.  Suppose  that  (56)  holds.  Then  we  may  put,  hi  (56),  /?  =  a0  and 
obtain 


Also,  by  (54),  a^K'  <  K0. 

Therefore,  by  definition  of  N&,  a^N™.  Hence,  by  (48), 

JM<A.  (57) 

On  the  other  hand,  fa  -<  K"  =  K^ 

and  therefore,  by  (49)  and  (51), 

f(01)^B  =  {l,2,...,k}-A,    /(AH-4.  (58) 

We  deduce  from  (57)  and  (58)  that 


which  contradicts  (52).  This  completes  the  proof  of  theorem  1. 

Proof  of  theorem  2 

16.   As  in  theorem  1,  we  are  given  a  set  N  and,  corresponding  to  every 
element  v  ofN,  an  interval  Iv  of  the  ordered  set  S.  We  introduce  the  notation 

[N']=   S  /„     (N'CN). 

v^N' 

In  order  to  prove  one  part  of  theorem  2,  we  assume  the  existence  of  a  finite 
number  of  sets  Nlt  .  .  .  ,  Nk  satisfying 

(59) 
(60) 
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We  want  to  prove  that  N  can  be  normally  ordered  in  such  a  way  that,  given 
any  element  x  of  [N],  the  set 


contains,  in  the  sense  of  this  normal  order,  a  last  element. 
Define  any  normal  order  of  the  set 


and  any  normal  order  of  the  set 

N"  =  N-N', 

the  ordering  relation  in  both  cases  being  denoted  by  "<".  Extend  this 
ordering  relation  so  as  to  obtain  an  ordering  of  Nt  by  putting,  in  addition, 

a<b     (a^N"tb^Nf).  (61) 


It  is  well  known  that  this  process  leads  to  a  normal  ordering  of  the  set 

N  —  N'  +  N".  If  now 


then,  by  (60),  x*([NK]  (62) 

for  at  least  one  index  K  in  the  range  1  ^  K  ^  k.  Let  K19  K2,  .  .  .  ,  Km  be  all  distinct 
indices  AC  satisfying  (62).  Then  we  have  l^m^k.  Ifl^/^m  then  there 
exists,  by  (59),  exactly  one  index  vft  such  that 


Hence  the  set  N'N(x)  is  finite  and  not  empty.  It  possesses,  in  the  sense  of 
the  normal  order  defined  above,  a  last  element  v*.  Then  v*  is  the  last  element 
ofJV(a:).  For  let  v^N(x)  =  N'N(x)+N"N(x).  If  v-^N'N(x),  then,  by 
definition  of  v*9  v  ^  v*.  If  v  X  N"N(x)  C  N"  then,  by  (61),  v  <  v*.  Hence  the 
condition  stated  in  theorem  2  is  necessary. 

17.  In  this  section  we  assume  that  N  has  been  normally  ordered,  ordering 
relation  "  <  ",  in  such  a  way  that,  given  any  point  x  of  [N],  the  set  N(x) 
contains  a  last  element.  For  convenience  we  define  a  symbol 

t*M9 

where  t  is  either  the  number  0  or  the  number  1  and  M  is  any  set,  by  putting 

/the  empty  set  (t  —  0) 


I  define,  by  means  of  transfinite  induction,  a  function  f(v).  Let  y0-^  jY,  and 
suppose  that  for  every  v  <  v0  a  number  f(v)  has  already  been  defined  satis- 

fying 
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Then  put  f(v0)  =  0, 

2/,,  (63> 


and  put  /(v0)  =  1 

otherwise.  This  defines  /(v)  for  every  v,  and  we  have 


Put  N'  =  S  /(")*{*>}. 

v^N 

Then  IN']  =  [N].  (64) 

For  it  is  obvious  that  [IV7]  C  [N].  (65) 

Now  let  XQ  be  any  element  of  [N].  Then  there  exists,  by  hypothesis,  a  last 
element  v0  of  the  set  N(x0). 

Case  1.  Suppose  that/(i>0)  =  0.  Then  (63)  holds,  and  hence 


Then,  by  definition  of  v0, 

Therefore  »0X  S  /M*^,     *o^4 


for  some  ^  satisfying  v1  <  v0,    /(vj)  =  1. 

Then  v^N',     x^ 

Case  2.  Suppose  that/(p0)  =  1.  Then 


Since  x0  was  any  pouit  of  [N]  we  conclude  that 


This  result,  in  conjunction  with  (65),  proves  (64). 

The  proof  of  theorem  2  will  be  completed  when  I  have  shown  that 

R(Nf)  ^  2.  (66) 

For  then,  by  theorem  1,  there  are  sets  Nlt  N2  satisfying 


I  prove  (66)  indirectly.  Assume  the  existence  of  indices  a,  /?,  7  and  of  a  point 
.satisfying 
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We  shall  deduce  a  contradiction.  We  have 


4-4,  (67) 

v«x,  v>a 

say.  Then  IftCA.  (68) 

For  if  ft>a  then  IpC  2  IVCA, 

vxx, 

and  if  0<a  then  /,  =  f(fi)*IflCA. 

By  reasons  of  symmetry,  we  have 

lyCA.  (69) 

By  (67),  (68)  and  (69),  /«<£/,  +  /,,. 

One  can  find  a  point  #a  such  that 

x.4  ia  -4(7,  +/r). 

By  reasons  of  symmetry,  one  can  find  points  jfy,  xy  satisfying 

Xf^If-m  +  l,),     *r-<4-4(4  +  7,).  (70) 

Without  loss  of  generality,  we  may  assume  that 


If 

then  z  ^  Xp  ^  xy,     z,  xy  -^  Jy, 

and  therefore,  by  definition  of  intervals, 


This  contradicts  (70).  Hence          x^<z. 

Then,  similarly,  xa  ^  Xp  <  z,    xa,  z  -<  /a,     xft  -<  Ja. 

This  contradicts  (70),  and  theorem  2  is  proved. 

Special  covering  theorems  for  sets  of  real  numbers 

18.  In  this  last  section  S  denotes  the  set  of  all  real  numbers,  ordered 
according  to  their  magnitude.  The  letter  T  stands  for  subsets  of  S.  The 
letter  /  denotes,  as  before,  intervals  of  S.  When  S  is  the  set  of  real  numbers 
the  meaning  of  the  term  "interval",  as  defined  in  this  note,  coincides  with 
its  common  meaning.  It  denotes  a  set  of  numbers  x  defined  by  any  of  the 
following  four  relations  : 

a<x<b,    a<#<6,     a<#<&,     a^x^b. 
Here  a  and  b  are  either  real  numbers  or  any  of  the  symbols  ±  oo. 
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A  set  T  is  called  semi-open  if  x  <J  T  implies  the  existence  of  a  number 
y  =  y(x,  Z7)  4=  0  satisfying 


Every  open  set  of  real  numbers  is  semi-open.  The  sum  -set  of  semi-open  sets 
is  semi-open.  The  intersection  of  a  finite  number  of  semi-open  sets  is  semi- 
open.  Every  interval,  with  the  exception  of  those  which  consist  of  a  single 
point,  is  semi-open. 

A  quantitative  note  is  introduced  into  our  covering  problems  by  assuming 
that  a  measure  \  T  \  has  been  denned  which  possesses  certain  properties. 
Let  M  be  a  set  of  subsets  of  S.  The  set  M  plays  the  part  of  the  system  of  all 
"measurable"  sets  .  We  suppose  that  M  satisfies  the  following  condition: 

If  J,J'-4M 

then  J  +  J'^M,     JJ'^M,     J-JJ'^M.  (71) 

The  letter  J  will  always  denote  elements  of  M  .  With  every  J  is  associated 
a  non-negative  number  \J\,  the  "measure"  of  J,  which  satisfies  the 
condition 


=  \J\  +  \J'\     if     JJ'  =  0.  (72) 

This  implies  that,  for  any  Jl9  J2» 


and  hence  I  «M  ^  K  +  ^2  N  I  ^i  I  +  I  ^l-  (73) 

19.  THEOREM  3.  Suppose  that  with  every  element  v  of  a  set  N  is  associated 
a  semi-open  set  Tv  of  real  numbers.  Then  there  exists  an  at  most  enumerable 
subset  N'  of  N  satisfying  ^  T  =  ^  Tv.  (74) 

v^N'     V        f^N 

THEOREM  4.  Let  M  be  a  set  of  sets  of  real  numbers  satisfying  (71).  Let  \  J  \ 
be  a  non-negative  measure  defined  for  all  sets  of  M  and  satisfying  (72).  Suppose 
that  with  every  element  v  of  a  set  N  is  associated  an  interval  Iv  of  real  number  sy 
and  let  Iv  belong  to  M  and  contain  more  than  one  number.  Finally  ,  suppose 
that  there  is  a  constant  G  such  that 

|  /^  +/„,+  .  ..  +  /J«:<7  (75) 

for  every  positive  number  r  and  every  choice  of  r  elements  vp  of  N.  Then,  given 
e  >  0,  there  exists  a  finite  number  of  elements  ap  of  N  and  a  sequence  of  sets 
Jw  of  M  satisfying  r  m 


2),  (77) 

(78) 
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THEOREM  5.  Suppose  that  the  hypotheses  of  theorem  4  hold  and,  in  addition, 
that  the  set 


belongs  to  M.  Suppose  that  the  measure  \  J  \  satisfies  the  condition 


whenever  the  set  2  J\  belongs  to  M  and  the  series  2  |  J\  \  converges. 


(79) 


Then  there  are  a  finite  number  of  elements  ftlt . . . ,  flm  of  N  satisfying 

A     S/,  -e,  (80) 

4     v^N 


)•  (81) 

The  special  case  of  theorem  3  when  the  sets  Tv  are  open  coincides  with 
the  one -dimensional  case  of  Lindelof's  covering  theorem.f  Without  any 
restrictions  on  the  sets  Tv  theorem  3  is,  of  course,  not  valid,  as  is  shown  by 
the  example  in  which  N  is  the  set  of  all  real  numbers  *>,  and  Tv  =  {v}. 

Theorem  4  implies  that  there  exist,  under  the  hypotheses  stated,  two 
finite  systems  of  non-overlapping  intervals,  namely 


which  between  them  cover  "approximately"  the  same  set  as  the  whole 
system  /,,  (v-^N).  The  following  example  shows  that  (79)  does  not  follow 
from  the  hypotheses  of  theorem  3.  Take  as  M  the  set  of  all  sets  T.  Put 


if  there  are  numbers  a  and  6  such  that  a  <  0  <  6, 

x-^J     (a<x<b,  x=tQ). 
Put  |  J  |  =  i 

if  no  such  pair  of  numbers  a,  b  exists  but  if  there  is  a  number  c  4=  0  satisfying 


Put  |  J  |  =  0 

for  all  remaining  sets  J.  Then  (72)  holds.   Let  JA  be  the  closed  interval 
(A"1, 1)  (A  =  1,  2, ...).  Then  (79)  is  false,  since 


Loc.  cit. 
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Theorem  5  generalizes  a  well-known  lemma  in  the  theory  of  real  variables. 
In  (80)  the  constant  J  is  best  possible,  as  is  shown  by  the  following  example. 
The  system  M  consists  of  all  sets  of  real  numbers  which  are  Lebesgue- 
measurable,  and  |  J  \  is  the  Lebesgue-measure  of  J.  Let  0  <  t  <  1,  and  put 


Then,  for  any  finite  number  of  indices  /?,,  satisfying  (81), 


The  constant  £  in  (80)  cannot  be  improved  because  (1  +t)~l-+$  as  £->  1. 

20.  We  require  one  more  lemma,  which  may,  on  account  of  its  generality, 
possess  some  interest  of  its  own.  It  generalizes  a  known  theorem  on  sets 
of  inter  vals.f 

LEMMA  7.  Suppose  that  with  every  element  v  of  a  set  N  is  associated  an 
arbitrary  set  Tv  of  real  numbers.  Then  every  real  number  x,  except  those  con- 
tained in  an  at  most  enumerable  set  X,  satisfies  exactly  one  of  the  following 
conditions  : 

(i)  There  exists  an  element  v  such  that  x  is  an  interior  point  of  Tv. 
(ii)  Given  any  element  v,  there  exist  two  sequences  am,  bm  of  numbers  such  that 

am,bm^Tv,  (82) 

am<x<bm     (m=l,2,...),  (83) 

*fc 

lim  am  =  lim  6m  =  x.  (84) 

m—  >oo  m->oo 

Proof.  Let  #  be  a  number  which  satisfies  neither  (i)  nor  (ii).  Then  there 
exists  an  element  v  =  v(x)  such  that,  corresponding  to  these  x  and  v,  110 
sequences  am,  bm  can  be  found  satisfying  (82),  (83),  (84).  Then  one  can  find 
a  number  c(x)  =\=  0  such  that 

x  +  sc(x)«(Tv{x)     (0<*<1).  (85) 

Let  X+  be  the  set  of  those  numbers  x  of  X  for  which  c(x)  >  0,  and  put 
X_  =  X  —  X+.  Suppose  that  the  numbers  xlt  x%  satisfy 


Then  x±  +  c(xl)  ^  x2.  (86) 

For  otherwise  x1<xz<x1  +  c(xI). 

"f  Hobson,  loc.  cit.,  last  proposition  of  §73. 
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Then  one  could  find  numbers  slt  s2  such  that 


Then  we  deduce,  by  applying  (85)  to  x  ~  xlt  that  x%  is  an  interior  point  of 
TVl,  i.e.  that  x2  satisfies  (i).  This  contradicts  the  fact  that 


Hence  (86)  is  established.  Thus 

xi  <  xi  +  c(xi 
whenever  xl9  x2~^X+,     x1<xz. 

No  two  of  the  open  intervals  (x,x  +  c(x))  overlap  when  x  ranges  through 
-X+,  and  therefore  X+  is  at  most  enumerable.  By  reasons  of  symmetry  this 
groves  at  the  same  time  that  X_  is  at  most  enumerable.  The  lemma  is 
proved. 

Proofs  of  theorems  3,  4  and  5 

2  1  .  Suppose  that  the  hypotheses  of  theorem  3  hold.  Consider  any  bounded 
open  interval  /  which  has  rational  end-points  and  which,  moreover,  satisfies 
1CTV  for  at  least  one  element  v.  All  these  intervals  /  can  be  arranged  in 
a  sequence,  say  as  ,.(1)  ™} 

Choose  vp  such  that  I(p)^TVp     (p  =  1,  2,  ...). 

Let  X  be  the  at  most  enumerable  set  defined  in  lemma  7,  belonging  to  the 
semi-open  sets  Tv  of  theorem  3.  Then 


where  the  system  of  numbers  x^  is  either  finite  or  enumerably  infinite. 
Corresponding  to  every  number  x^,  choose  an  element  v'^  satisfying 


00=1,2,...). 
Then  (74)  holds  when 


For  assume  that  a  number  x  satisfies 

x^TVo  (87) 

for  some  VQ  of  N,  but  at  the  same  time 

(88) 


We  shall  deduce  a  contradiction.  (88)  implies  that  x  HtX.  Hence,  by  lemma  7, 
either  (i)  x  is  interior  point  of  some  set  Tv,  and  therefore 

(89) 
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for  some  p  >  0,  or  (ii)  there  are  sequences  am,  bm  satisfying  (83),  (84)  and 


(m=l,2,...).  (90) 

The  relation  (89)  is  a  contradiction  against  (88).  On  the  other  hand,  the 
simultaneous  validity  of  (87),  (83),  (84)  and  (90)  contradicts  the  fact  that 
TVQ  is  semi-open.  Thus  a  contradiction  follows  in  either  case,  and  theorem  3 
is  proved. 

22.   Suppose  that  the  hypotheses  of  theorem  4  hold.  We  may  assume, 
without  loss  of  generality,  that 


Since  the  sets  Iv  are  semi-open,  there  exists,  by  theorem  3,  an  at  most 
enumerable  set  N'  such  that 

SI  —  y  / 
•*i>  —  Zj  •*-}>• 


We  may  assume  that 


'  =  {1,2,3,...}. 


Unless  the  contrary  is  stated  the  letters  v,  A,  n  denote  typical  elements  of  JV '. 

Put  j  -  /    /  y  / 

Jx=  ^A  —  -*A  2j  V 

v<A 

Then  JA  JA,  =  0     (A  4=  A'). 

It  follows  from  (73)  and  (75)  that  there  exists  the  limit 

lim     S  !„!  =  £. 

71— >QO      V^n 

Hence  a  positive  number  n0  can  be  found  satisfying 


Then,  for  n  >  n0, 


n0 

S 

1 


Hence,  making  7i->oo, 
Also 


n0+l 


Kl' 


(92) 
(93) 


1  1  «0+1 

Let  r  be  the  smallest  number  such  that  there  are  r  indices  oip  satisfying 


ap 


(94) 
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Then  1  <  r  <  n0.  Furthermore, 


since  otherwise  r  could  be  reduced  to  r  —  1   by  the  omission  of  some 
index  ocp.  Hence  one  can  choose  numbers  xp  satisfying 

.  (95) 


By  numbering  suitably  one  can  obtain  a  case  in  which 

fljj  ^  3?2  ^  •  •  •  ^  Xp, 

Then  the  numbers  a1}  a2,  ...,  ar  have  the  property  required  in  theorem  4. 
For  we  have,  by  (91),  (93)  and  (94), 

S  i  =    S  i  =  S/.p+     S    •£-  (M) 

v^JV  v^JV'  p-1  i/=n0+l 

We  have  to  deduce  a  contradiction  from  the  existence  of  indices  p,  q 
satisfying 


Choose  a  number  x  such  that 

^-<4A- 

Then  an  argument  similar  to  one  used  at  the  end  of  §  17  will  lead  to  a  con- 
tradiction. 


Case  1.  Suppose  that  x  ^  xp+1.  Then 

3  <  XP+1  <  ^'      x>  XQ  X  4,»      «n+l  X  43» 

which  contradicts  (95). 

Case  2.  Suppose  that  #p+1  <  x.  Then 

^P  ^  XP+I  <  x>     xp>  x  X  ipj     ^D+I  ^  -4p 

which  contradicts  (95).  This  contradiction  establishes  (77).  Relations  (76) 
and  (78)  follow  from  (96)  and  (92)  respectively  if  one  puts 

^<A)  =  40+A     (A  =1,2,...). 
This  completes  the  proof  of  theorem  4. 

23.  It  is  easy  to  deduce  theorem  5  from  theorem  4.  Suppose  that  the 
hypotheses  of  theorem  5  hold.  By  theorem  4,  there  are  r  indices  ocp  and  sets 
J(A)  satisfying  (76),  (77)  and  (78).  A  suitable  numbering  of  the  indices  ap  will 

ensure  that  I    V   /    I  <  I   V  7    I 

I    2^   -x  I  ^  I   2j  •%  I  • 

p  even  •  p  odd 
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By  using  property  (71)  of  M  and  the  hypotheses  of  theorem  5,  we  see  that 
the  set  oo  r 

S  JW=  S  /„-  S4 

A  =  l  v^N  p  =  l      p 

belongs  to  M  .  Hence,  by  (78)  and  (79), 

I  S4I-I  S  4P+  S 

j/«(JV  peven  p  odd  A 

«l  S  4,1  +  1  S 

p  even  p  odd 

|  24 

p  odd 

<2|  S  4, 

podd     H 

and  so  |    2  /    |>J|   2  /,|-e. 

podd     p  v^^ 

Hence  theorem  5  follows  if  one  puts 


For  (81)  is  a  consequence  of  (77). 


The  University 
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THE  CENTRE  OF  FLEXURE  OF  A  HOLLOW  SHAFT 

By  A.  C.  STEVENSON 
[Received  2  December  194:6— Read  19  December  1946] 

1.   History  of  the  problem 

The  flexure  problem  for  a  hollow  tube  of  circular  cross-section  with  a 
cavity  of  circular  cross-section  placed  eccentrically,  was  unsolved  when  it 
was  suggested  in  1906  by  Lovef  that  the  classical  Saint-Venant  flexure 
functions  could  be  found  as  series  expansions  in  suitable  orthogonal  curvi- 
linear coordinates,  and,  taking  note  of  this  suggestion,  Young,  Elderton, 
and  Pearson  J  in  1918  wrote  down  the  form  of  the  solution  in  series  for  the 
case  where  the  load  is  at  right  angles  to  the  axis  of  symmetry  of  the  cross- 
section.  They  did  not  proceed  further,  however,  to  evaluate  explicitly  the 
coefficients  in  these  series,  and  to  complete  the  solution  by  finding  the  mean 
twist  of  the  cross-section  when  the  centroid  is  taken  as  the  load-point  of 
the  cross-section.  It  was  not  until  1936  that  Seth§  gave  the  solution  for  the 
Saint-Venant  flexure  functions  for  the  cases  where  the  load  is  resolved  along 
and  perpendicular  to  the  axis  of  symmetry,  and  the  associated  twist  due 
to  the  latter  resolute  when  the  centroid  is  the  load-point. 

In  this  paper  the  problem  is  solved  by  finding  the  canonical  flexure 
functions  and  moment  integrals  introduced  by  the  author  ||,  to  facilitate 
a  systematic  investigation  of  such  problems,  and  all  coefficients  of  the  series 
solutions  are  found  explicitly  and  the  associated  twist  is  determined.  The 
solution  of  the  problem  when  any  other  point  of  the  cross-section  is  the  load- 
point  depends  upon  the  superposition  of  the  solution  for  the  torsion  pro- 
blem, with  an  appropriate  additional  twist  proportional  to  the  distance  of 


t  Love,  Mathematical  Theory  of  Elasticity,  2nd  edition  (1906),  325. 
j  Young,  Elderton  and  Pearson,  Draper's  Co.  Research  Memoirs,  Tech.  series, 
no.  vn  (1918),  69. 

§  Seth,  Proc.  Ind.  Acad.  Sci.  4  (1936),  631,  and  5  (1937),  23. 
||  Stevenson,  Phil.  Trans.  Roy.  Soc.  237  (1938),  161. 
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the  load  from  the  centroid.  The  torsion  solution  for  this  cross-section  was 
given  in  1893  by  Macdonaldf. 

An  alternative  way  of  relating  the  constants  of  the  mathematical  solution 
with  the  external  force  system  is  to  give,  in  addition  to  the  torsion  solution, 
the  solution  when  the  load-point  is  chosen  so  that  the  mean  twist  of  the 
cross-section  is  zero.  This  load-point  is  termed  the  centre  of  flexure,  and 
Timoshenko  J  remarks  that  its  position  is  a  matter  of  practical  importance 
in  thin- walled  sections,  and  that  its  exact  theoretical  determination  has 
only  been  carried  out  for  one  or  two  cross-sections.  Another  load-point  of 
interest  is  the  centre  of  least  strain,  denned  as  the  load-point  which  results 
in  minimum  strain  energy  of  the  beam.  It  can  be  shown  that  the  coordinates 
of  this  centre  of  least  strain  can  be  obtained  from  those  of  the  centre  of 
flexure  merely  by  setting  the  elastic  constant  equal  to  zero.  In  compact 
sections  the  two  points  are  in  practice  barely  distinguishable.  General 
formulae  have  been  given  by  the  author  for  the  associated  twist  and  these 
centres  of  flexure  and  least  strain  in  terms  of  the  canonical  moment  in- 
tegrals, and  in  this  paper  the  numerical  values  of  the  associated  twist  and 
coordinates  of  the  centres  of  flexure  and  least  strain  are  given  for  a  compre- 
hensive range  of  values  of  the  radius  of  the  inner  cavity  and  the  displace- 
ment of  its  centre  from  the  centre  of  the  outer  boundary.  These  numerical 
results  are  especially  interesting  as  the  section  weakens  by  the  cavity 
becoming  larger  or  more  eccentrically  placed  or  both. 

Seth  did  not  consider  the  centre  of  flexure,  and  gave  numerical  results 
for  the  associated  twist  in  two  cases  only.  Neither  Macdonald  nor  Seth 
considered  especially  the  limiting  case  when  the  cavity  just  reaches  the 
surface  of  the  cylinder.  In  this  case  it  will  be  shown  that  the  moment  in- 
tegrals can  be  expressed  in  terms  of  trigamma  and  tetragamma  functions, 
which  have  been  extensively  tabulated.  The  results  of  this  paper  are 
accordingly  almost  entirely  supplementary  to  those  of  Seth  §. 


2.    Introductory  analysis 
as  we  employ  orthogonal 
=  ctanj£     (z  =  x  +  iy,  £  =  g  +  i^).  (2-1) 


For  these  cross-sections  we  employ  orthogonal  curvilinear  coordinates 
,  fi  defined  by 


t  Macdonald,  Proc.  Camb.  Phil.  Soc.  8  (1893),  62. 

j  Timoshenko,  Theory  of  Elasticity  (1934),  301;  Strength  of  Materials,  1  (1931),  195. 

§  Since  this  paper  was  written  (in  1938),  complex  variable  treatments  of  these 
problems,  of  considerable  interest,  have  been  devised.  See,  for  example,  R.  Capildeo, 
"On  the  torsion  function  in  Saint -Venant  flexure  with  shear,"  Ph.D.  thesis,  London 
University,  1948. 
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Then  the  curves  given  by  £  =  const,  are  a  set  of  coaxial  circles  with  z  =  ±ic 
as  common  points,  and  the  orthogonal  family  given  by  y  —  const,  are  a  set 
of  coaxial  circles  having  real  limiting  points  $1}  S2  at  z  =  ±  ic,  corresponding 
to  TJ  —  ±  oo.  The  ^-axis  between  the  limiting  points  is  given  by  £  =  0,  and 
outside  the  segment  of  the  y-axis  connecting  these  two  points  £  =  ±  TT, 
and  there  is  a  discontinuity  of  277-  in  crossing  the  y-axis  outside  this  segment. 
We  shall  take  the  boundary  of  our  cross-section  to  be  given  by  y  =  a, 
rf  =  /?,  with  /?  >  a.  These  two  non-intersecting  circles  will  be  assumed  to 
have  radii  a,  b  with  a>6,  and  to  have  their  centres  (71?  C2  at  a  distance 
/apart  (see  Fig.  1). 


The  radii  are  given  by 

a  as  ccosecha,     b  =  ccosech/?, 


(2-2) 


and  if  the  centres  of  the  circles  are  at  the  points  (0,  d),  (0,  d')  respectively, 

then 

(2-3) 


Hence 

and  we  write 


/=  d  —  d'  =  c(cotha  —  coth/?), 
/'  =  d  +  d'  =  c(coth  a  +  coth  /?). 


(2-4) 
(2-5) 
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From  these  we  deduce  that       //'  =  a2  -  62,  (2-6) 

4c*f*  =  (f+a  +  b)(f+a-b)(f-a-b)(f-~a  +  b),  (2-7)  • 
and  d  and  d'  are  given  in  terms  of/  by  the  equations 

2/,     d'  =  (a2-&2-/2)/2/.  (2-8) 


In  practice  we  are  given  a,  6  and  /,  and  c  is  then  calculated  from  (2*7). 
Since  a  and  /?  appear  in  the  combinations  a  ±  /?,  there  is  some  advantage 
in  numerical  calculations  in  using  the  equations 

sinh(/?-a)  =  c//a6,     sinh(/?  +  a)  =  c(a2  -  &2)/a&/.  (2-9) 

Now  equation  (2-1)  gives 

x  +  iy  =  c(sin£  +  isinh?;)/(cos£  +  cosh?/),  (2-10) 

and  this  can  also  be  rewritten  as 

x  +  iy  =  -ic{l- 


whence  a?  «  -  2c  2  (  -  1  )n  e~HJI  sin  ?ig,  (2-11) 

n=l 
/  «  \ 

i/  =  c  1  +  2  S  (-l)we-n^cos^[.  (2-12) 

\  n  =  l  J 


From  (2-10)  we  find  easily  that 


=  —  ic2  5-5  —  f  c2  coth  ?/  ^  —  c*x, 
2 


2/3  = 


and  then,  using  (2-11)  and  (2-12),  we  have 

J#3  =  c3  2  (-  l)n  {K^2  +  2)-^  coth  y}e-ni>  sin  wg,  (2-13) 

w=l 

Ji/3  _-  c3(J  +  Jcosech2?/) 

g,   (2-14) 
(2-15) 
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We  then  derive  also  the  following  useful  results 

oo 

-  %x,  (2-16) 


c2  S  (-l)nne-n>>8inng  =  Jcr-  =  -\xyy  (2-17) 

n=l  <??/ 

oo  a2~ 

^2e-n*  sin  n£  =*  -  Jc2  ^  =  \<?x  +  i(^3  -  3V)»  (2>  *  8) 

<ty 

*e-^oosng  =  t(y-c)f  (2-19) 

g^y 

»«-"*  cos  w£  =  -}c/  = 


(2-20) 
c3  S  (-l)*nae^coswg  =  Jc2^  =  Ky8  -  3a^)  -  Jc^.  (2-21) 

n-l  ^ 

3.    TAe  canonical  flexure  functions 

For  the  solution  of  the  problem  of  the  flexure  of  an  elastic  beam,  pos- 
sessing a  cross-section  having  the  i/-axis  as  an  axis  of  symmetry  and  under 
the  action  of  a  load  W  parallel  to  the  #-axis,  the  author  has  shown  that  we 
need  three  complex  functions  of  z  (  =  x  +  iy),  Q,1}  a)z,  o>3,  where  ^  =  Xi  +  *Ai  » 
a)r  =  <j)r  +  iilfr  (r  =  2,3),  such  that  the  plane  harmonic  functions  Xi*  &%>  fis 
satisfy  the  equations 

<>,  (3-1) 


-  to8  ==  const.,  (3-2) 

(3-3) 


round  the  boundary  of  the  cross-section,  n  denoting  the  normal  to  the 
boundary. 

In  virtue  of  (2-13)-(2-15)  these  boundary  conditions  become 

£y  °° 

—^  =  —  c3  2  (  —  1  )w  ^{i(^2  +  2)  —  w  coth  1}  —  cosech2  ?/}  e~ni>  sin  ?if  , 


°3  S  (-l)n{i(tta  +  2)-f  ttcoth7/-f  cosech2  ^e-71^  cos  Tig  +  const., 

n=l 
oo 

°2  S  (  ~  1  )n  2  coth  T/  e-71^  cos  n£  +  const. 
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We  assume  suitable  plane  harmonic  forms  for  %1?  ^2,  ^3,  namely, 

Xi  =  c3  S  (  -  1  )n  {A  n  +  A'n  e?nri}  e-n*  sin  ng, 

w=l 

9v  °° 

so  that  -      =  -c3  S  (-l)nnA-A'e2nife 


also  V^2  =  c3  2  (  -  *  )n  {^n  +  ^n  e2w*}  e~n*  cos  ^£> 


The  boundary  conditions  will  be  satisfied  along  ij  —  a  and  7)  =  fl  if 
^w  -  ^  J,  ezna  =  %(n2  +  2)  -  n  coth  a  -  cosech2  a, 
An  -  A'n  e2nfi  =  %(ri*  +  2)  -  n  coth  /?  -  cosech2  ft, 
Bn  +  5^  e2wa  =  J(n2  +  2)  +  n  coth  a  +  cosech2  a, 

n^  =  J(^2  +  2)  +  n  coth  /?  +  cosech2  ft, 

™  =  2  coth  a, 


Solving  these  equations  for  the  constants  Ant  A'nt  Bn,  B'n,  Cn,  C'n  we  find, 
on  making  use  of  (2-2)-(2-5),  that 


where  aw  =  ncoth?i(/?  —  a),     a^  =  we~n<a+^)  cosech  n(fi  —  a),  (3-4) 

6n=     oothw(^-a),     6^=     e-^a+^>  cosech^  -a).  (3-6) 

Then  with  the  aid  of  (2-16)-(2-21)  we  find  that 

Xl  =  Xll  +  X12>       V^2  =  fal  +  ^22>       ^3  =  ^ 
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where,  dropping  irrelevant  additional  constants, 

Xu  =  A<*»  -  3*2/2)  +  If'xy  +  i*(»2  +  62  -  c2),  (3-6) 


Xiz  =  -  J/c2  S  (-l)nKe 

n=l 

-  i//'c  S  (-  l)w  {6ne-^  +  6;e^}sinrig,       (3-7) 


'c  S  (-l)»{6ne-«»-6;^}oosnf,       (3-8) 

(3-9) 


^8i  =/c  2  (-l)w{6ne-»*-&;e"i}cosw£t  (3-10) 

71=1 

*n  =  U'y,  (3-n) 

and  hence     Q!  =  nn4-ni2,     w2  =  w21-|-a>22,     <w3  =  w314-w32> 

where          Qn  =  ^u  +  *A3i  =  A*3-^/'  *2  +  j2(a2  +  62-c2),  (3-12) 

(3'  1  3) 
(3*14) 

and  ^21  =  Xi2  =  i/'&i-i/c2  S  (-  l)n{ane-»^  +  a;e^}sinng,   (3-15) 

w=l 

^31  =  ~/C  I)  (  ~  1  )n  {^>n  e~^  +  b'n  ^}  S.in  wf-  (3*  J  6) 

n=l 

4.    TAe  canonical  moment  integrals 

We  next  find  certain  moment  integrals  taken  over  the  cross-section  and 
obtained  from  the  canonical  flexure  functions  by  the  following  formulae  : 


(4-1) 
(4-2) 
(4-3) 

where  L[  =  real  part  of    iz  -^dSt  (4-4) 

J      az 

M'r  =  real  part  of  [izdS     (r  =  1,  2),  (4-5) 
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which  can  be  expressed  alternatively  in  the  form  of  line  integrals  round  the 
boundary  of  the  cross-section  as 


(ydS 


M'r=  h(*  +  lf)Qd8.  (4-7) 

J  OS 

For  this  cross-section  there  is  no  difficulty  in  obtaining  the  following  results: 

S  =  n(a2~b2),  (4-8) 

(x*dS  =  i7r(a4-64),  {(x2-y2)dS  =  ~7r(aW-62^2),     (4-9) 

(x*ydS  =  %7T(a*d-bW),      ((yz-Zx*y)  dS  =  7r(a2dB-b2d'3).      (4-10) 

We  write  L[  —  L'1±-\-  L'12,  M'2  =  M2l  +  M22,  M'3  =  M31-t-M'32  to  correspond 
with  the  splitting  up  of  x\>  ^2  an(l  ^3>  an(^  then  use  (4*4)  and  (4«5)  for  the 
evaluation  of  L'n,  M22,  M^2)  and  (4-6)  and  (4'7)  for  the  evaluation  of 
L'12,  M21,  M'31.  We  find  readily,  making  use  of  (4-8)-(4-10),  that 

^ii  =      %n(a2d3  -  b2d'3)  -  %7rf'(a2d2  -  b*d'2)  -  J7r(a2  +  62  -  c2)  (a*d  -  bzd'), 

2  +  62  4-  c2)  (a*d  -  b2d'), 


Now  from  (4-7),  using  (2-15)  and  (3-16), 


=  -4?r/2c2  2  <-27r/c2^  S  ne-2n«-27Tfc2d'  2  we-2^. 

n=l  n  =  1  ?i=l 

But  S  7ie~2^  =  J  cosech2  x,  (4-11) 

?i=i 

whence,  making  use  of  (4-11)  and  (2-2), 


Hence,  adding  MBlt  M32)  and  using  (2-9),  (4-3)  and  (4-9),  we  have 

M3  =  J77(a4  -  64)  -  47ra2&2#1,  (4-12) 

where  ^  =  sinh2  (ft  -  a)  S  ^e-w<a+^>  cosech  n(yff  -  a).  (4-  1  3) 
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This  agrees  with  Macdonald's  result  for  the  torsion  moment 

In  similar  fashion  we  readily  find  from  (4-6)  and  (4-7),  using  (2-16)  and 
(3-15),  that 


na'n-7rdfc*  2  nze-*™-7rd'fc9  2  n 

n  —  \  n=l  «.=! 

But  from  (4-11)  we  deduce  that 

2  ft2e-2ux  =  Jcotha  cosech2^;  (4-14) 

n=l 

hence,  substituting  for  Jf^,  using  (4-14)  and  adding  Jf^i'  -^2a»  we  now  ^n(^ 


and  so  from  (4-2)  and  (4-10),  using  (2-2)  and  (2-3),  we  obtain 

--*~ 


,  (4-15) 

where  $t  is  given  by  (4-13)  and  82  by 

$2  =  sinh3  (/ff  -  a)  S  n*e-***+ffi  cosech  TI(/?  -  a).  (4-16) 

n«=l 

Similarly,  adding  i/^  and  L{2,  and  making  use  of  (4-1),  (4-10)  and  (4-15), 

we  find  that  ,-         __.        .   .,     iA-it\  it  -»n\ 

LI  =  M2-rr(a*d-b*df).  (4-17) 

We  can  find  alternative  expressions  for  the  series  8^  and  >Sa  of  positive 
terms,  since 

S  nern^+&  cosech  n(/3-  a)  =  2  £  ne-^/(l 

n=l  n=l 


or,  from  (4-11), 

fli  =  is 

In  similar  fashion  we  find,  with  the  aid  of  (4-14),  that 


-a)  2  cosech2  [w(/ff-«  )  +  /?].  (4-18) 

w=0 
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To  express  a*d  —  b*d'  in  terms  of  a,  b,  /,  we  find  from  (2-8) 


a*d  -  b*d'  =  i/(a*  +  &*)  +      (a2  +  62)  (a2  -  62)2.  (4-20) 

It  is  also  possible  to  express  the  terms  of  the  series  S±  and  Sz  explicitly 
in  terms  of  a,  b  and  /,  since  we  can  show  that 

1 
Wn-^n-i),  (4  21) 


sinh  (/?-<*)       ~  /n-n-i, 
cosh  |>(/?  -  a)  +  ft]  =  |  Qn  -  -a  Qn_v  (4-22) 

J 

in  which  Qn  is  a  polynomial  in  x,  where 

x  =  2  cosh  (£-  a)  =  (a2  +  62  -/2)/a&,  (4-23) 

and  Q«  is  given  by 


_ 

»-  1!  2! 

(4-24) 

We  replace  d  and  ^'  in  (4-22)  by  their  explicit  values  from  (2-8).  After  the 
first  few  terms  of  the  series  the  explicit  expressions,  in  terms  of  a,  b  and  / 
only,  rapidly  become  unwieldy,  however,  and  computation  of  8±  and  $2  is 
best  effected  by  finding  c  from  (2-7),  given  a,  6  and/,  and  then  deducing 
(2-9). 


5.    Special  case  —  circular  boundaries  in  contact 

When  the  cylindrical  cavity  just  reaches  to  the  boundary  of  the  cylinder, 
the  solutions  found  for  the  moment  integrals  require  special  treatment 
before  they  can  be  of  direct  numerical  use.  This  case  is  obtained  by  letting 
c,  a  and  ft  all  tend  to  zero  in  such  a  way  that 

lim  c  cosech  a  =  a,     lira  c  cosech  ft  =  6. 
In  this  case/  =  a  -6,  and  from  (4-23)  and  (4-24) 

lim  x  =  2,     lim  Qn  =  n  4-  1  , 

a;-^2 

so  that  (4-21)  and  (4-22)  give 

Binh-a)]^  o 


__ 
a-b 
SHB.  2.    VOL.  50.    NO.  2413.  KK 
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We  can  make  the  term-by-term  passage  to  the  limit  for  the  series  S1  and  S2, 
since  they  can  be  shown  to  be  uniformly  convergent  with  respect  to  x. 
We  have  then  n 

(5-1) 


where  ^'(z)  is  the  trigamma  function,  and 

(6-2) 


where  y^"(z)  is  the  tetragamma  function. 

Hence  putting  6  =  Aa  (A<1),  we  find  from  (4-12),  (4-15)  and  (4-17), 
using  (5-1)  and  (5-2),  that 

MB  =  i7r 


-A5),  (5-4) 

(5-6) 


which  are  readily  calculated  at  once  from  the  tablesf  for  these  functions. 

6.    The  associated  twist 

The  author  has  previously  shownj  that  the  associated  twist  per  unit 
length  (T)  for  a  uni  -axial  cross-section  is  given  by 

T=  (W/EI){A  +  By},  (6-1) 

where  E,  ij  are  Young's  modulus  and  Poisson's  ratio  respectively,  /  is  the 
second  moment  of  the  cross-section  about  the  axis  of  symmetry,  W  being 
the  load  parallel  to  the  #-axis,  the  load-point  being  the  centroid  (0,  k)  of 
the  cross-section,  and  A  and  B  are  given  in  terms  of  the  canonical  moment 
integrals  by  the  formulae 

A  =  -(£1+2&7)/J/3>  (6-2) 

B  =  -  (Li  -  M2  +  kM3  +  2M)/MZ.  (6-3) 

We  express  all  lengths  in  terms  of  the  radius  a  of  the  outer  boundary, 
putting  b  —  Aa,  /  =  /*«,  and  construct  tables  in  terms  of  A  and  /*  in  steps 

t  Davis,  Tables  of  the  Higher  Mathematical  Functions,  2  (1936),  Principia  Press 
(U.S.A.)'  British  Association  for  the  Advancement  of  Science,  Mathematical  Tables, 
1  (1931). 

J  Stevenson,  loc.  cit.t  173. 
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of  0-1.  Table  I  gives  the  torsion  moment,  Tables  II  (a)  and  (6)  give  A  and  B 
for  the  associated  twist. 

7.    The  centres  of  flexure  and  least  strain 

The  load-point  when  the  mean  twist  taken  over  the  cross-section  is  zero, 
i.e.  the  centre  of  flexure,  has  been  shownf  to  have  coordinates  (0, g0) 
given  by  g<>  =  _  [L^^-kM^I,  (7-1) 

where  cr  is  the  modified  Poisson's  ratio  given  by  (1  +  ?y)  (1  —  cr)  =  1. 

For  these  cross-sections  we  shall,  however,  compute  the  distance  $  of 
the  centre  of  flexure  from  the  centre  of  the  outer  circular  boundary,  mea- 
sured positively  away  from  the  centre  of  the  inner  boundary,  so  that 

S  =  gQ-d,     or     $  =  p  +  (rq,  (7-2) 

where  p  =  -  (LJ2I)  -  (/a  +  a2  -  62)/2/,  (7-3) 

g  =  (Jfa-fcMi)/2I.  (7-4) 

The  point  (0,^>)  is  the  load -point  which  results  in  the  strain-energy  of  the 
elastic  cylinder  being  a  minimum  (the  centre  of  least  strain).  Table  III  (a) 
gives  the  position  of  this  centre  of  least  strain.  Table  III  (6)  shows  the 
possible  difference  in  position  between  this  point  and  the  centre  of  flexure. 
Table  IV  gives  the  position  of  the  centroid  of  the  cross-section  for  com- 
parison, and  y  is  measured  in  a  fashion  similar  to  8,  i.e.  y  —  k  —  d. 

Changes  are  rapid  in  the  last  interval  in  row  or  column  for  A,  B,  p  and  q, 
and  computation  becomes  laborious.  In  Table  III  (6)  the  last  entries  in 
a  row  or  column  change  sign,  except  in  the  column  A  =  0- 1 ,  where  we  have 
evidence  of  the  change  of  sign  occurring  before  /i  =  0-8.  To  show  the  decrease 
in  the  tabulated  quantity  —  q/a  for  /*  =  0-1  between  A  =  0-8  and  0-9,  we 
calculate  for  the  case  /*  =  0-1,  A  =  0-875,  finding 

MS  =  0-39448,     p/a  =  0-62072,      -q/a  =  0-05530, 

showing  that  ( —  q/a)  has  passed  its  maximum  for  variation  with  respect  to  A. 
The  limiting  case  as  A  and  fi  tend  to  unity  and  zero  respectively  is  of 
interest,  and  leads  to  the  value  d  —  l-5a,  which  may  be  contrasted  with  the 
known  result  for  the  limiting  case  for  the  split  ring  section  of  uniform 
thickness  (tending  to  zero)  and  of  radius  a,  which  leads  to  £  =  2-Oa.  A  differ- 
ence is  to  be  expected,  of  course;  for  example,  the  infinitely  thin  straight 
line  segment  can  be  approached  alternatively  from  a  rectangle  or  from  a 
circular  sector,  in  the  first  case  the  centre  of  flexure  is  at  the  midpoint,  in 
the  second  it  is  at  a  distance  0-$a4-crO-26a  from  the  apex  end. 

t  Stevenson,  Zoo.  tit.,  176. 

KK2 


548 


A.  C.  STEVENSON 


[19  Dec, 


The  torsion  moment: 
TABLE  I. 


\  A 
/<\ 

0-1 

0-2 

0-3 

0-4 

0-5 

0-6 

0-7 

0-8 

0-9 

0-1 

1-57000 

1-56566 

1-55178 

1-51857 

1-45152 

1-33135 

1-13166 

0-80794 

0-00497 

0-2 

1-56810 

1-56777 

1-53272 

1-48190 

1-38619 

1-21728 

0-91949 

0-03741 

— 

0-3 

1-56492 

1-54452 

1-49974 

1-41824 

1-26839 

0-99003 

0-11786 

__ 

— 

04 

1-56044 

1-52565 

1-45309 

1-32140 

1-07101 

0-25806 

— 

— 

— 

0-5 

1-55465 

1-50063 

1-38732 

1-17162 

0-45956 

— 

— 

— 

— 

0-6 

1-64745 

1-46793 

1-29065 

0-71239 

— 

— 

— 

— 

— 

0-7 

1-53861 

1-42201 

0-99382 

— 

— 

— 

— 

— 

— 

0-8 

1-52703 

1-26736 

— 

— 

— 

— 

— 

— 

— 

0-9 

1-48180 

__ 

— 

— 

_ 

_ 

— 

__ 

— 

The  associated  twist:  T  =  (If  I  El)  (A  +  By) 
TABLE  II  (a).  A/a 


Y 

/*  \ 

0-1 

0-2 

0-3 

0-4 

0-5 

0-6 

0-7 

0-8 

0-9 

0-1 

0-00199 

0-00799 

0-01848 

0-03309 

0-05429 

0-08610 

0-13923 

0-26539 

102-2613 

0-2 

0-00393 

0-01588 

0-03716 

0-06783 

0-11511 

0-19437 

0-37569 

21-2836 

_ 

0-3 

0-00577 

0-02357 

0-05724 

0-10682 

0-19468 

0-39658 

7-64248 

— 

— 

0-4 

0-00734 

0-03101 

0-07695 

0-15594 

0-33327 

3-33836 

— 

— 

— 

0-5 

0-00894 

0-03835 

0-10110 

0-22729 

1-57137 

— 

— 

— 

— 

0-6 

0-01020 

0-04597 

0-13513 

0-74160 

— 

— 

— 

— 

— 

0-7 

0-01129 

0-05641 

0-32672 

— 

— 

— 

— 

— 

— 

0-8 

0-01254 

0-11926 

— 

— 

— 

— 

— 

__ 

__ 

0-9 

0-02557 

— 

— 

— 

— 

—  • 

— 

— 

— 

TABLE  11(6).  B/a 


\ 

/*  \ 

0-1 

0-2 

0-3 

0-4 

0-5 

0-6 

0-7 

0-8 

0-9 

0-1 

0-00002 

0-00010 

0-00048 

0-00068 

0-00161 

0-00413 

0-01166 

0-05097 

102-4973 

0-2 

0-00017 

0-00079 

0-00272 

0-00569 

0-01400 

0-03744 

0-13243 

21-5023 

— 

0-3 

0-00057 

0-00271 

0-00942 

0-02098 

0-05576 

0-18512 

7-84056 

— 

— 

0-4 

0-00137 

0-00664 

0-02166 

0-05778 

0-18321 

3-51214 

— 

— 

— 

0-5 

0-00270 

0-01367 

0-04696 

0-13318 

1-71730 

— 

— 

— 

— 

0-6 

0-00477 

0-02554 

0-10014 

0-85643 







— 

— 

0-7 

0-00786 

0-04767 

0-40792 











— 

0-8 

0-01286 

0-16603 













— 

0-9 

0-04142 

— 

— 

— 

— 

— 

— 

— 

— 
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The  centre  of  flexure:  d^ 
TABLE  m(o).  p/a 


\  A 
P\ 

0-1 

0-2 

0-3 

0-4 

0-5 

0-6 

0-7 

0-8 

0-9 

0-1 

0-00300 

0-01215 

0-02829 

0-05188 

0-08684 

0-14009 

0-22808 

0-40898 

1-36628 

0-2 

0-00594 

0-02411 

0-05633 

0-10377 

0-17502 

0-28555 

0-48156 

1-21401 

— 

0-3 

0-00878 

0-03571 

0-08477 

0-15612 

0-26768 

0-45592 

1-04282 

— 

— 

04 

0-01133 

0-04684 

0-11133 

0-21082 

0-37572 

0-85510 



— 

— 

0-5 

0-01390 

0-05749 

0-13947 

0-26922 

0-65704 



— 



— 

0-6 

0-01611 

0-06803 

0-17128 

0-45945 

_ 



— 

— 

— 

0-7 

0-01813 

0-08031 

0-27763 











— 

0-8 

0-02027 

0-12971 









_ 

— 

— 

0-9 

0-03321 

— 

— 

_ 

— 

— 

— 

— 

_ 

TABLE  111(6).  -q/a 


\A 
t\ 

0-1 

0-2 

0-3 

0-4 

0-5 

0-6 

0-7 

0-8 

0-9 

0-1 

0-00197 

0-00784 

0-01792 

0-03216 

0-05193 

0-07981 

0-12095 

0-18680 

-0-00389 

0-2 

0-00376 

0-01499 

0-03388 

0'06016 

0-09519 

0-13972 

0-18739 

-0-00825 

— 

0-3 

0-00518 

0-02054 

0-04603 

0-07954 

0-11965 

0-15312 

-0-01956 

—_ 

— 

0-4 

0-00593 

0-02371 

0-05157 

0-08474 

0-19697 

-0-03280 

— 

— 

— 

0-5 

0-00617 

0-02362 

0-04821 

0-07204 

-  0-04554 

— 

— 

— 

— 

0-6 

0-00536 

0-01912 

0-02898 

-0-05345 

— 

— 

— 

— 

— 

0-7 

0-00336 

0-00792 

-0-05179 

— 

— 

— 

— 

— 

— 

0-8 

-0-00032 

-0-03780 

— 

__ 



— 

__ 

— 

— 

0-9 

-0-01495 

— 

— 

— 

— 

__ 

— 

— 

— 

The  centroid:  y  -  k-d 
TABLE  IV.  y/a 


Y 

fi\ 

0-1 

0-2 

0-3 

0-4 

0-5 

0-6 

0-7 

0-8 

0-9 

0-1 

0-00101 

0-00417 

0-00989 

0-01905 

0-03333 

0-05625 

0-09608 

0-17778 

0-42632 

0-2 

0-00202 

0-00833 

0-01978 

0-03810 

0-06667 

0-11250 

0-19216 

0-35556 

— 

0-3 

0-00303 

0-01250 

0-02967 

0-05714 

0-10000 

0-16875 

0-28824 

— 

— 

0-4 

0-00404 

0-01667 

0-03956 

0-07619 

0-13333 

0-22500 

— 

— 

— 

0-5 

0-00505 

0-02083 

0-04945 

0-09524 

0-16667 

— 

— 

— 

— 

0-6 

0-00606 

0-02500 

0-05934 

0-11429 

— 

— 

— 

— 

— 

0-7 

0-00707 

0-02917 

0-06923 

— 

— 

— 

— 

— 

— 

0-8 

0-00808 

0-03333 

— 

— 

— 

— 

— 

— 

— 

0-9 

0-00909 

— 

— 

— 

— 

— 

— 

— 

— 
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ON  DIOPHANTINE  APPROXIMATION  TO  CERTAIN 
EXPONENTIAL  AND  BESSEL  FUNCTIONS 

By  H.  TODD 
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1.  The  problem  of  simultaneous  rational  approximations  to  several 
irrational  numbers  is  one  of  considerable  interest  and  importance.  The 
fundamental  existence  theorem  is  due  to  Dirichlet,  who  proved  (1)  that, 
if  #!,  #2,  .  .  .  ,  68  are  any  irrational  numbers,  there  exists  an  infinity  of  integer 
sets  (plt  p2,...,  pa,  q)  satisfying 


with  c  —  1.   It  may  be  shown  that  the  index  I  Is  is  the  best  possible  for 
arbitrary  6it  but  the  best  possible  value  of  c  is  not  known  for  s^2. 

No  simple  algorithm  is  known  for  finding  integers  q  which  satisfy  the 
above  inequality.  However,  the  theorem  quoted  implies  the  weaker  result 
that  there  exists  a  sequence  of  integers  (tn)  such  that  the  difference  between 
tn6i  and  the  nearest  integer  to  it,  for  i  —  1,  ...,#,  tends  to  zero  as  TI->OO. 
For  some  particular  irrationals,  we  may  find  such  sequences  (tn)  by  special 
methods.  Jacobi  showed  (2),  by  a  generalization  of  the  continued  fraction 
algorithm,  how  this  problem  could  be  solved  for  the  numbers  0j_  =  $pt 
6%  —  tfP2  (#  =  2,3,5).  For  example,  he  proved  that  for  the  numbers 
^  =  8/3,  #a  =  $9  we  may  take  the  sequence  (tn)  defined  by  the  difference 


with  t_2  =  —  4,  t_1  =  1,  £0  =  0  as  initial  values  for  tn. 

In  this  paper  it  is  shown  that  solutions  of  a  certain  type  of  linear  difference 
equation  of  order  5+1  provide  simultaneous  rational  approximations  to  s 
irrationals,  which  are  determined  by  the  coefficients  of  the  difference  equa- 
tion and  the  initial  values  chosen  for  the  formation  of  the  solutions.  In 
particular,  the  irrationals  thus  defined  may  be  simple  combinations  of 
exponential  or  of  Bessel  functions,  and  some  examples  of  this  type  are  given 
to  illustrate  the  general  theory. 

The  author  would  like  to  express  his  thanks  to  the  referees  who  have 
greatly  helped  in  the  presentation  of  the  results. 
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2.   The  difference  equations  considered  in  this  paper  are  of  the  form 


in  which  the  coefficients  bn  r  are  polynomials  in  n  defined  by 

bn>r  =  tean_rlr\a?     (r  =  0,  ...,*),  (2-2) 

where  Aaw  =  an+1  —  an,  and 

an  =  a*n'  +  fan*-1  +  ...+&,  (2*3) 


where  a,  /?15  ...,  fl8  are  integers,  a>0  and  an=t=0  for  any  integer 
Clearly  6W>0  =  aw  and  bn>8  =  1. 

It  will  be  shown  that,  if  pn,  tn  are  any  solutions  of  (2-1),  then  |  pn  —  tnO  \  ->  0 
as  n  ->  oo,  where  0  is  a  number  depending  on  the  choice  of  an  and  on  the 
initial  conditions  defining  the  solutions  considered.  Since  the  difference 
equation  (2-1)  will  have  s-h  1  linearly  independent  solutions,  corresponding 
to  a  given  solution  tn  there  will  be  s  sequences  (pn)  and  so  s  numbers  6. 
In  order  to  prove  this  result  and  to  determine  the  numbers  6,  we  need  to 
introduce  some  auxiliary  functions  defined  as  follows: 


lftn,  r)  =  an  .  .  .  an+r/-«-  ±  -  "+^>m_  /n  }          (2.5) 

\  m=Qanan+l'"an+m 

for  r  —  0,  ...,5,  with  &(n,  —  1)  =/n_«_i, 


with  the  same  definition  of  PM  in  terms  of  (pn),  where  (pn)  is  any  other 
sequence  of  integers  satisfying  (2-1). 

We  may  now  state  the  principal  result  of  this  paper  in  the  form  : 

THEOREM.  The  difference  equation  (2-1)  Ms  s+l  linearly  independent 
solutions  tn,pn,qn>  ...}in  integers  such  that 

-i+e 

\pn-tnPQIT0\<A(e)tns    ,  (2-7) 

where  e  is  an  arbitrary  positive  number,  and  A(e)  is  independent  of  n  and 
depends  only  on  e;  with  similar  relations  for  the  other  solutions  qnt  .... 

3.  The  proof  of  the  theorem  may  conveniently  be  made  to  depend  on 
a  number  of  lemmas.  We  first  collect  together  some  simple  results  which 
are  required  later. 
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LEMMA  1. 

t(n-r,r)  =  ani/r(n-r,r-l)-bnirfn_s    (r  -  0,.  ...«),  (3-1) 

jfr(»,«)  =  0,  (3-2) 

^(n-s,  a  -!)=/„_>„,  (3-3) 

lKn-r,r)~     S          6n-r+m,».      /m_r_<+m.  (3-4) 

m=r-|-l  "71+1  •••  ^n-f-m—  r 

The  relation  (3-1)  follows  immediately  from  the  definition.  To  prove 
(3-2)  we  note  that 

00 

fn-s-l-fn-8  =   H  K+r-aJ/K-.-^+rf'la'-), 
r=l 

ir\ 
whilst  an+r  =  an  +  r  Aan  +        A2an  +  .  .  .  +  Aran, 

\^7 

s         /, 

an  fVinf  f  —    V         ^n+m.  m       f 

SO  tnat  /n-s-l  —    2j    -  -  -          /n-s+m' 

m=0«w  •••^n-fm 

which  is  simply  (3-2)  written  in  full.  The  relation  (3-3)  follows  at  once  from 
(3-1)  and  (3-2),  and  finally  (3-4)  is  derived  from  (2-5)  and  (3-2). 

LEMMA  2.  Tn  -  anTn_v.  (3-5) 

For 


=  0, 

since  tn  satisfies  (2-1). 

LEMMA  3.  There  exists  an  integer  n0  such  that,  for  appropriate  initial  values 
t0,  t_lf  ...,  t_8,  we  have 

for  all  n  ^  n0. 

We  may  show  by  considering  the  difference  equation  that  tn  is  ultimately 
of  fixed  sign,  and  this  will  be  positive  if  the  initial  values  are  suitably  selected. 
Again,  we  may  clearly  take  n  large  enough  to  make  i/r(n  —  r,r)>()  for 
r  =  0,  ...,a—l,  and  i/r(n+  1,  —  1)  =  fn_s>  1.  Then  for  n^n0,  say,  all  the 
terms  of  the  expression  (2-6)  for  Tn  are  positive,  so  that  Tn  >  tn  >  0. 

LEMMA  4.  &-      =  --L-  'sVilft*  -'*').  (3*6) 
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From  Lemma  2  we  have 


and  so  Pnltn  -  P0IT0  =  pjtn  -  PJTn 

=  (PnTn-tnPn)ltnTn 


r=0 

We  must  now  investigate  the  behaviour  of  the  right-hand  side  of  (3-6) 
for  large  n,  and  in  particular  that  of  ur.  The  estimate  for  ur  is  deduced  from 
one  for  %  =  pntn_l  —  2V_i£n,  anc^  *°  obtain  the  latter  we  now  require  a 
general  theorem  from  the  theory  of  difference  equations;  this  we  shall  state 
as  a  lemma,  referring  the  reader  to  the  literature  for  the  proof  (3). 

LEMMA  5.    Let 


,  .  .  .  „  /0  _. 

yn  4-  a±(n)  yn_±  +  .  .  .  +  oc8(n)  yn_9  =  0  (3-7) 

be  an  equation  of  Poincare's  type,  that  is,  let  lim  oc^n)  =  at  exist  for  i  —  1  ,  .  .  .  ,  s. 

W->00 

Let  qlt  ...,^0-  be  the  distinct  moduli  of  the  roots  of  the  associated  characteristic 
equation  ts  +  ^  ts-i  +  .  _  t  +  ^  =  0> 

and  let  ZA  be  the  number  of  roots  whose  modulus  is  qx,  multiple  roots  being  counted 
multiply.  Then  provided  a8(n)^=0  for  integral  n,  the  difference  equation  (3-7) 
has  a  fundamental  system  of  solutions  which  fall  into  o"  classes,  such  that,  for 
the  solutions  of  the  Xth  class  and  their  linear  combinations, 


The  number  of  solutions  of  the  Ath  class  is  ZA. 

LEMMA  6.  |  %  |  =  {bn>1}  (1  +  en)«/««,  (3-8) 

wAcre  {bn>  J  =  bn>  ^  bn_lf  1...bk>1, 

k  being  any  integer  independent  of  n,  and 

lim  en  =  0. 

W-^-QO 

Since  pn,  tn  satisfy  (2-1),  it  follows  (4)  that  vn  =  Pn/tn—Pn-iltn-i  satisfies 
the  equation 


,-+tn-8-l)vn-2+'--+tn__8_1Vn_a  =  0. 

If  we  substitute 


we  obtain       yn  +  **&f±^  y^ 

un  1  cw-2 
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Now  &w,1~6n-i,i~  - 


and  £n_x  -  (arc)8  tn_2  ~  (arc)2*  ^_3  - . . . ; 

it  follows  that  (3-9)  is  of  Poincare's  type,  with  associated  characteristic 
equation 

y*  +  y8-1  +  ( l\  y«-2$~2  + . . .  +  s-°  =  0, 


Hence,  by  Lemma  5,  lim  #|  yn  \  =  ljs, 

so  that  we  may  write  |  yn  \  —  (l+en)n/sn, 

where  lim  en  —  ®- 

It  follows  that  |  %  |  =  |  Pnt^i  -pn^ 


and  the  lemma  is  proved. 

LEMMA  7.      |  ur  \  <A{bn>1}  (1  +^n)»»'-V-»     (r  =  1,  ..., 
tuyere  ^4  t5  independent  of  n  and  yn  ->  0  as  n  ~>  oo. 

Writing  <on  =  pntn^-pn~^n  =  ^i>  we  have 


,  . 

"i   •  •  •  ~r  ~ 


If  we  define  yn  =  max(0,en,  ...,en_8+1),  and  put  nn  =  {^n>1}5-w,  we  have 
?/n  -^  0  as  n  ->  oo,  and 


Further,  flrn_i  =  &nnlbnt  x  <  ^tt1-^, 

where  ^41?  ^12>  •  •  •  »  occurring  here  and  later  denote  numbers  independent  of  tt. 
Hence  7rn_m<A2n1in(:L-8)7rn  for  any  fixed  ra.  Recalling  that  fn  ^  (an)™8  tn_m 
it  follows  that  |  ur  \  <  Anr~lnn(l  +  9/n)n,  which  is  the  required  result. 
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LEMMA  8.  {6n>  x}  <  A  (k)  snn^~8^  &~  D/8 

for  any  fixed  integer  k>nQ  +  s,  where  A(k)  is  independent  ofn. 
Direct  calculation  gives 


(w^+n«^[(«-  1)  A/a--  i^(«-  1)]+  ...), 

SO  that  jWlV-ia-A^+1,1  <  ^(1  +  48/n»). 

«w_l  •  •  •  #n-s+l 

Writing  0n  for  the  expression  on  the  left-hand  side  of  this  inequality,  we  have 

*+v  ft  (l+A9lm*) 


since  fl  (1  +A^m2)  converges.  This  may  be  written 

A         A2  /,3  Iv«  /,«  ^8 

°n,  1  pn-i,  1  °n-2,  1  •  •  •  °n-a+l»  1  °n  -af  i  •  •  •  °fe,  l 

2  g  _  i         '~"g  _  i        ~     g—  ~i       ~      ^- 

an-lan-2  •••  aw-s+la?i-8  •••  ak 

or  {6nrl}g        4      ^  ^7l  6^2lt  1  •  •  •  ^^-a+2. 

1/7  V~l          5  ns-i-na-*        n 

\anj  an      O<n-2--  an~8+2 


since  bn>  1/aw  ~  s/an. 

Now  let  us  fix  k>n0  +  s,  and  so,  in  particular,  6n>r>0  (r  =  0,  ...,«)  and 
^w>0  for  n^k  —  s—  1.   Hence 


so  that 

It  follows  then  that  {6W  J  < 
LEMMA  9.  ^(n 

From  equation  (3-4)  we  have 

#(n  -  r,  r)  ~      _        (an 


LEMMA  10.  i%s>A%a,nn\, 

where  A,  B  are  independent  of  n. 

In  Lemma  8  we  proved  that  tn  >A(k)  {&n}.  Now 


giving      log  {aj  =  2  log  ar  =  S  *  log  ar  +  H  log  (  1+  ftjra*  +  ...). 

r>=k  r=k  r=k 
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But  Slog(l +&/**+. ..)~~k>gn,        if  A*0 


~  constant,     if  p±  =  0, 
and  the  result  follows. 

Proof  of  the  theorem.  Using  Lemmas  3,  4,  7,  8  and  9,  we  have 

s-l 
n  r-0 


But 

for  any  positive  e,  and  the  theorem  follows. 

4.  Special  cases.  Changing  the  notation  slightly  by  writing  s  —  1  in  place 
of  s,  and  putting  a  =  sm,  an  =  m8~^(sn—  \)(sn  —  2)  ...  (sn  —  s+  1),  we  see 
that  the  conditions  of  the  theorem  are  satisfied.  Further,  the  functions 
/-i>  /-2>  •••»  f-s  can  a^  ^e  expressed  as  linear  homogeneous  functions  of 
exp  (tt)k/m)  (k  —  0,  .  .  .  ,  ,9  —  1  ),  where  a>  is  a  primitive  sth  root  of  unity,  though 
the  explicit  formulation  is  laborious  for  large  s.  We  give  examples  of  the 
cases  s  —  2,  3,  4. 

(i)  5  =  2.  Here  we  have 

tn  =  m(2n-l)tn_i  +  tn_2y 

1  "11 

To  =  *o/-i  +  -  Wo  =  £0coshL  +«_1sinh-  . 
Utvim1>/00         m        x         m 

Thus,  if  we  take  the  initial  values 


we  have  P0  =  cosh  1/m,  T0  =  sinh  1/m,  and  we  obtain  the  known  result  (5) 


,  ,    1  11 

coth  —  =  m  +  -  —     — 

m  3m  H-  5m  +  .  .  . 


(ii)  5  =  3.   Here 

tn  =  m 


where  3Ffc  =  S  a>fcy  exp  -   ,    a)  =  exp 
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Taking  as  initial  values 

o,     p.*,     p.2\        /I,     0,     0 

=   °>    i» 

0,        0, 


we  have  P0  =  F0,     00  =  ^ 

this  choice  of  initial  values  ensures  that  Jacobi's  algorithm  applied  to 
YQ  :  Y2  —  2mY1  :  Fx  will  give  "partial  quotients  " 

an  =  m2(3tt-l)(3r&-2),     bn  =  6m(n~l). 
(iii)  5  =  4.   In  this  case 

-  3)*n_1+  3wa(16na-  32n+  17)  ^w_2 


and  T0  =  ^0  Z0  +  ^(^  -  3m^2  -f  Qm2Zl)  +  t_2(Z2  ~ 


1,  l  +  3m-, 

0,           -1,  -9m 

0,       1,           3m,  -l  +  21m2 

L,      -1,  l-3m,  -l+9m-21m2/ 

a  system  which  yields  linearly  independent  pn,  qn,  rn)  tn,  since  its  deter- 
minant ==  —  8  =f=  0,  we  obtain 


Hence 


all  tend  to  zero  as  n~>co. 

The  following  numerical  results  for  m  =  2  give  some  idea  of  the  closeness 
of  the  approximation. 

t±  =  29,     ^2  =  48,907,     ^  =  387,366,095, 
«4  =  8,460,160,614,721,     ^5  =  393,500,185,153,048,349 
and  \pn-tne\t*n  =  0-522...,  0-343...,  0-262...,  0-215...,  0-184..., 

4  =  0-124,  0-077...,  0-057...,  0-045...,  0-032..., 
<*  =  0-237...,  0-155...,  0-118...,  0-096..,,  0-087..., 
^=  1,2,3,4,5. 
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(iv)  We  obtain  results  similar  to  the  above  by  putting 

a  —  sm,     an  —  m8~*(sn  —  l±)  (sn  —  12)  .  .  .  (sn  —  ls_i), 

where  ^,  ..  .,  I8^i  are  integers  incongruent  to  zero  and  to  each  other  modulo  s. 
As  an  example,  if  a  —  3m,  an  =  m2(3n—  1)  (3n+  1),  we  find  (omitting  a 
common  factor)  that 


where  Yk  is  as  defined  in  example  (ii)  above.  The  Jacobian  algorithm  applied 
to  Y2  :  Fj  +  ra2Fa  —  mY0  :  YQ  —  mYz  gives  "partial  quotients  " 


an  =  m2(3^-l)(3^+l),     bn  = 
(v)  If  we  put  s  ~  2,an  =  an  +  fi,  we  have 


(t  t')=(i  9- 

we  obtain  the  known  result  (6) 


(vi)  Taking  s  =  4,   a  =  2m,  an  =  m3(2?i+  l)(2w  +  /ff)(2w,  +  /?  + 1),  where 
m  and  m/?  are  positive  integers,  we  obtain 

Jn  =  m3(2n 
+  m2(^ 
Selecting 

2-l,      0,         1,  (2~fi)m 

•a+l,       0,          1,  (2-^)m 

m,  —  1, 

we  find  that 


all  tend  to  zero  as  n  ->  oo. 
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ON  THE  TOTAL  VARIATION  OF  A  FUNCTION 
OF  TWO  VARIABLES* 

CORRIGENDUM 

By  H.  P.  MtJLHOLLAND 
[Received  12  February  1948] 

The  corrections  detailed  below  are  necessitated  by  the  discovery  that 
a  certain  formula,  given  by  Saksf  for  the  area  of  a  continuous  surface 
z  =  f(x,  y),  is  not  correct  for  all  such  surfaces  without  restriction. 

(i)  In  Theorem  1,  on  p.  294,  at  the  end  of  the  last  sentence  but  one,  there 
should  be  inserted  the  restrictive  clause  "provided  tJwt,  in  the  case  of  the 
formula  (2-92),  the  corresponding  formula  for  the  area  of  the  surface  S(E)  is 
valid".  An  alternative,  and  equivalent,  restriction  ensuring  the  validity  of 
(2-92)  would  be  "provided  thatf(x,  y)  is  absolutely  continuous  in  L.  C.  Young's 


In  the  proof  of  Theorem  1,  at  the  end  of  §  12,  on  p.  310,  the  footnote 
reference  to  Saks  should  be  supplemented  by  one  to  L.  C.  Young's  paper}. 

*  Proc.  London  Math.  Soc.  (2),  46  (1940),  290-311. 

t  Theory  of  the  integral  (2nd  ed.)  (Warsaw,  1937),  v,  (8-4).  The  error  was  pointed 
out  by  Jarnik,  and  the  requisite  emendations  have  been  made  by  L.  C.  Young,  "An 
expression  connected  with  the  area  of  a  surface  z  =  F(x,  y)",  Duke  Math.  Journal,  11 
(1944),  43-57,  who  had  suggested  the  formula  in  the  first  place. 

J  Cf.  L.  C.  Young,  loc.  cit.,  especially  (3-7). 
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It  may  be  noted,  further,  that  the  formula  (2-91)  in  Theorem  1  and  the 
formula  (12- 31),  p.  309,  for  VR[f]  are  both  valid  without  the  new  restriction. 

(ii)  In  the  proof  of  Theorem  5,  the  part  of  a  sentence  appearing  on  p.  311, 
after  the  comma,  should  be  replaced  by  the  following: 

. . .  and  the  arguments  employed  by  Rado  and  Tonelli*  to  deal  with  these 
expressions  can  be  adapted  without  trouble  to  show  that  the  total  variation 
of  the  rectangle-function  |  yx(I)  \  over  R  is  equal  to  the  Tonelli  integral 
PF[0;/;  R],  and  similarly,  for  the  function/^,  cf.  §  1,  that 

W\0\f\R\  =  TF[0; /';#>] 

equals  the  total  variation  over  Re  of  |  yx(fe\  I)  |.  But  since  yx(fe\  I)  is  a 
continuous  and  additive  function  of  rectangles,  and  indeed  (after  an  obvious 
extension  of  its  definition)  of  general  polygonal  regions  also,  this  total 
variation  will  be  independent  of  the  orientation  of  the  rectangles  used  in 
the  underlying  dissections  of  Re :  it  must  thus  be  equal  to  the  total  variation 
over  R6  of  |  vjx(f9',  I0)  \ .  Now  Tjx(f0\  I6)  equals  the  component  of  the  vector 
ri(I)  in  the  direction  inclined  at  angle  —  0  to  the  #-axis:  let  us  denote  this 
by  ?/£(/;  /).  As  1°  varies  in  Re  the  corresponding  rectangle  /  will  vary  in  R, 
and  so,  altogether,  W[6\f\  R]  will  equal  the  total  variation  over  R  of 
I  y$(f'>  I)  I-  On  integrating  with  respect  to  6  over  [0,7r],  and  observing  that 
the  integral  of  the  total  variation  is  equal  to  the  total  variation  of  the  integral, 
we  see  that,  in  virtue  of  (1-5)  and  (1-6),  2FR[/]  must  be  equal  to  the  total 
variation  over  R  of  *„ 

|ifc(/;/)|«W=2|*(7)|, 

J  0 

as  required  for  the  theorem,  the  last  equality  resulting  from  the  definition 
of  ?<(/;/). 

CORRIGENDUM 

ON  SIMPLY  HARMONIC  "KAPPA  SPACES"  OF 
FOUR  DIMENSIONS 

[Proc.  London  Math.  Soc.  (2),  60  (1948),  317-329.] 

H.  S.  RUSE 
Page  327,  equations  (6-1),  (6-2).  The  Christoifel  symbols  I  M,  {  4J,  j^j, 

{3  ) 
}  are  also  not  identically  zero,  but  do  vanish  if  (6-3),   (6-4)   are 

satisfied. 

*  Cf.  S.  Saks,  Zoc.  cit.t  v,  §§  6-8. 


